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To Richard and Roger 


Preface 


The opportunities for doing experiments at synchrotron and neutron 
facilities have grown rapidly in recent years, and are set to continue 
into the foreseeable future. Powerful third generation X-ray sources, 
comprising the APS in Chicago, the ESRF in Grenoble and SPring-8 
in Hyogo, which became operational in the 1990s, are being joined 
by an increasing number of new regional facilities; DIAMOND and 
SOLEIL, for example, to name but two. Similarly, the availability of 
neutrons is being enhanced by the upgrade and expansion of estab- 
lished facilities, like the ILL in France and ISIS in England, and the 
commissioning of next generation sources, such as the SNS in the 
United States and J-PARC in Japan. 

The early years of X-ray and neutron scattering were dominated 
by physicists, who traditionally had a strong background in the rel- 
evant theory and mathematics needed to understand the nature of 
the experiments; erudite texts, such as Lovesey (1986), could be con- 
sulted when details were required. The recent rise in the acces- 
sibility of synchrotrons and neutrons has been accompanied, if not 
driven, by a shift in emphasis from the study of hard condensed mat- 
ter to the investigation of biological samples and soft matter. As a 
result, the demographics of the users at the respective facilities has 
been changing rapidly: an increasing proportion are biologists and 
chemists, for whom the scholarly physics-oriented works are gener- 
ally too demanding to be helpful. This book aims to provide a gentler 
introduction to the subject, to cater for those who are less confident 
in their mathematical ability and physics background. 

It could be argued that a knowledge of scattering theory is not 
necessary for being a user at a synchrotron or neutron facility these 
days, because the techniques are sufficiently well-developed to be 
treated as standard tools where the required data analysis is carried 
out with tried-and-tested computer programs. While there is truth 
in this viewpoint, a basic understanding of the underlying theory is 
always intellectually fulfilling and useful: it educates us about what 
can be inferred reliably from the measurements and, hence, influ- 
ences which experiments are done. It is also important to remember 
that no analysis program is ever fool-proof or free from assumptions 
and approximations; consequently, it is advantageous to have some 
idea of how the data are related to the object of interest so that the 
numerical results can be checked against common sense. 

Although mathematics cannot be avoided in a theoretical discus- 
sion, my aim has been to write a book that most scientists will find 


vill Preface 


approachable. To this end, the first two chapters are devoted to pro- 
viding a tutorial on background material that is implicitly assumed 
in other texts. Thereafter, the philosophy has been one of keeping 
things as simple as possible: clarity and understanding have been 
favoured over a comprehensive treatment, through the omission of 
topics and detail that entail too much complexity. Despite my best 
efforts, some students will still find this book quite daunting. If 
they can get something out of it, then I will consider the endeavour 
to have been worthwhile. 

Finally, it is a great pleasure to acknowledge the help and encour- 
agement that I received from a number of friends and colleagues. 
First and foremost, Doryen Bubeck, Steve Collins, Bill Hamilton, 
Jerry Mayers, Jeff Penfold, Dave Waymont and John Webster, who 
gave me useful feedback on many of the chapters. At various points, 
I’ve also picked the brains of Nikitas Gidopoulos, Stephen Lovesey, 
Roger Pynn and Jorge Quintanilla. Assistance and advice on indi- 
vidual topics and chapters was given by Peter Battle, Iain Campbell, 
Marilyn Hawley, Steve King, Kevin Knight, Stewart Parker, Owen 
Saxton and Mona Yethiraj. Many thanks go to my Editor, Sonke Ad- 
lung, for his patience; and to Emma Lonie, April Warman and other 
staff at the Oxford University Press. Last but not least, I am grate- 
ful to Unilever R&D for their generous sponsorship of this book, and 
to Ian Tucker for his help in securing it. 


Oxford 
July, 2010 D.S.S. 
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Part I 


Some preliminaries 


Studying maiter at the atomic 
and molecular level 


The world around us is varied and complex but, as scientists, we 
hope to make sense of it by trying to understand the nature of its 
underlying constituents. The notion that the universe is made up 
of basic building blocks goes back to antiquity, with the word ‘atom’ 
itself coming from the Greek atomo meaning ‘indivisible’. Although 
this idea was put on a firm footing by chemists such as Dalton and 
Mendeleev in the nineteenth century, a theoretical picture of atoms 
did not emerge until the work of physicists like Thompson, Ruther- 
ford and Chadwick in the early part of the twentieth century. While 
subsequent experiments in high energy physics have revealed a rich 
subatomic structure of quarks and so on, our interest here is with 
the study of condensed matter (liquids and solids) at the atomic and 
molecular level. 


1.1 Length scales and logarithmic axes 


The SI, or Systéme International, unit of length is a metre and has 
a standard abbreviation of ‘m’. It is a sensible measure for every- 
day objects, such as people, because their size tends to be of this 
order. The breadth of a human hair, which has traditionally been 
used as a metaphor for something extremely narrow, is about ten 
to twenty thousand times smaller than a metre. Rather than using 
the fractional notation of 1/20000 m or 0.00005 m, which becomes in- 
creasingly cumbersome as the numbers involved get very small (or 
large), we tend to write it as 5 x 10~-°m or 0.05 mm or 50 um, where 
the abbreviated prefixes of ‘w (for milli) and ‘w (micro) represent a 
multiplication by 1073 and 107° respectively. 

The most common SI prefixes are listed in Fig. 1.1, which shows 
the corresponding magnitude factors drawn on a logarithmic scale. 
Whereas uniform increments on a linear (ordinary) axis denote the 
addition of a constant at each step, they indicate a multiplication 
by a given scale-factor on a logarithmic one; this difference is illus- 
trated in Fig. 1.2. In the case of Fig. 1.1, each (major) tick mark cor- 
responds to a thousand-fold increase in magnitude over its predeces- 
sor on going up the axis. Everyday examples of logarithms include 
musical scales, where successive octaves double the frequency of a 
note, the Richter scale for indicating the severity of an earthquake, 


1012 Tera 
10° Giga 
10° Mega 
10° kilo 
Hecto 
deca 
io (1) 
deci 
centi 
1079 milli 
1078 micro (4) 
1 0o79 nano 
io" pico 
10715 femto 


Fig. 1.1 The standard SI prefixes. 
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Fig. 1.2 The difference between linear and logarithmic axes. 


the decibel scale for reporting the power gain of a hi-fi amplifier, the 
decay of a radioactive sample as specified by its half-life, the growth 
of a bacterial colony as defined by its ‘doubling-time’, the pH scale 
for acidity and so on. 


Powers, roots and logarithms 


If a number, say a, is multiplied by itself, then we can write it as a’ and 
call it a-squared. A triple product can be written as a? and is called a- 
cubed. In general, an N-times self-product is denoted by a where the 
superscript, or index, N is referred to as the power of a. 

Although the meaning of ‘a to the power of N’ is obvious when N is a 
positive integer (i.e. 1,2,3,...), what happens when it’s zero or negative? 
This question is easily answered once we notice that the procedure for 
going from a power of N to N—1 involves a division by a. Thus if a° 
is a’ divided by a, then a to the power of nought must be one (or unity); 
similarly, if a~" is a° divided by a, then it must be equal to one over a; and, 
in general, a ™ is equivalent to the reciprocal of a’. Thus, we have 


ao =1 and a =—. (1.1) 


The basic definition of powers leads immediately to the formula for 
adding the indices M and N when the numbers a” and a are multiplied: 


Ga Sat (1.2) 


While our discussion has so far focused only on the case of integer powers, 
suppose that we were to legislate that eqn (1.2) held for all values of M 
and N. Then, we would be led to the interpretation of fractional powers 
as roots. To see this, consider the case when M=N=1/2; it follows from 
eqn (1.2) that a to the power of a half must be equal to the square root of a. 
Extending the argument slightly, if a to the power of a third is multiplied 
by itself three times then we obtain a; therefore, a'/* must be equal to the 
cube root of a. In general, the p'" root of a is given by 


a’? = Wa (1.3) 


where p is an integer. One final result on powers that we should mention 
is 
(a")" = aM , (1.4) 


which can at least be verified readily for integer values of M and N. 
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An alternative method of describing a number as a ‘power of something’ 
is to use logarithms. That is to say, if y is written as a to the power of x 
then z is the logarithm of y to the base a: 


p= ao = z = kew) (1.5) 
where the double-headed arrow indicates an equivalence, so that the ex- log,,(10°) = 0 
pression on the left implies the one on the right and vice versa. Since we log,,(10') = 1 
talk in powers of ten in everyday conversations (e.g. hundreds, thousands, log,,(107) = 2 
millions), the use of a= 10 is most common; this gives rise to the name 
common logarithm for log,,, often abbreviated to just log (but this can be In(e°) = 0 
ambiguous). Other bases that are encountered frequently are 2 and ‘e’ ln(e) = 1 
(2.718 ...); log., or In, is called the natural logarithm. ln(e) = 2 
By combining the definition of the logarithm in eqn (1.5) with the rule 
of eqn (1.2), it can be shown that the ‘log of a product is equal to the sum exp(z) = e” 


of the logs’ (to any base); and, in conjunction with eqn (1.1), that the ‘log 
of a quotient is equal to the difference of the logs’: 


log(AB) = log(A)+log(B) and  log(A/B) = log(A) — log(B). (1.6) 


Similarly, eqn (1.5) allows us to rewrite eqn (1.4) in terms of the log of a 
power and to derive a formula for changing the base of a log (from a to b) 


log(A’) = Blog(A) and log,(A) = log,(A) x log,(a). (1.7) 


Thus ln(x) = 2.3026 log; (x), where the numerical prefactor is In(10) to 
four decimal places, and so on. 


While hair may be extremely narrow, it is made of cells that are 
about ten times smaller; at around 10 um, they are not visible (or dif- 
ferentiable) to the naked eye. The simplest single-celled organisms, 4 Grain of salt 
or prokaryotes such as bacteria, are typically a tenth of the size of the 
average human cell; that is to say, about a micrometre (or a micron). 


mm — Pin head 


+ Human cells 


Viruses, which are little more than packets of genetic material sur- jiu —| Bacteria 
rounded by a coating of proteins, are a magnitude smaller at around 
0.1 um or 100nm. Whether it be viruses or organelles within cells, J Viruses 


such as ribosomes where the genetic code is translated into pro- 
teins, they are essentially assemblies of large molecules. Although 
these macro-molecules can be very long if stretched out like a string, nm — Molecules 
with the human genome being 1.8m for example, they tend to be 
twisted, folded, packed and clumped into units of order 10nm. The 
nucleic acids (DNA and RNA), proteins and complex carbohydrates 4 
are themselves made up of building blocks of smaller molecules (the 
nucelotide bases, amino acids and simple sugars), with dimensions 
of a nanometre. | 

The shortest length scale of interest to us pertains to inter-atomic 
distances, which are of order 0.1 nm or 100 pm. We often refer to it J 
by a special unit named after the physicist Ångström (1814—1874): 


-J Macro-molecules 


R- Atoms 


fm — Nuclei 


A — —10 — — 
LA = 10 m = 0.lnm = 100pm. Fig. 1.3 Physical length scales. 
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The nuclei of atoms, and the neutrons and protons of which they con- 
sist, are ten to a hundred-thousand times smaller than an angstrom: 
around 10715m or a few femtometres. The main point to note here 
is that the nuclei are point-like when compared with the dimensions 
of atoms. 


1.2 Resolution, magnification and 
microscopy 


Atoms cannot be seen with the naked eye because they’re too small, 
but what limits our ability to see objects below a certain size? It’s 
due to the wave-nature of light, which bends around edges, such as 
the sides of the pupils in our eyes, and causes us to see a slightly 
blurred view of the world. This effect can be noticed with large 
televisions where the individual pixels can be distinguished when 
standing close to the screen but not on moving further away; they 
appear to run into each other and merge to give a seamless picture. 
This observation tells us that the limit on seeing detail depends on 
the angular size of the object rather than its intrinsic dimensions. 
While the diameter of the sun is 400 times larger than that of the 
moon, for example, it has the same projection of about 0.5° on the 
sky because it’s also 400 times further away. 

For any optical device or imaging system, the smallest angular 
scale, 60, that can be resolved is given by 


(1.8) 


Angles and circular measure 


An angle is a measure of rotation, or turn, and is usually specified in 
degrees. There are 360° in a complete twist, where one ends up facing the 
same way as at the beginning; 90° therefore represents a right-angle, 180° 
an about-face, and so on. Despite our familiarity with degrees, a circular 
measure that is often more useful in a mathematical context is a radian. 
It is a dimensionless quantity defined as follows: if a radius of length R is 
spun through an angle 0 and generates an arc of length L, then 


? = R (1.9) 
Since the circumference of a circle is 27 R, 360°= 27 radians; a right-angle 
is 7/2 and, in general, an angle in degrees can be converted into one in 
radians by multiplying it by 7/180. Unless degrees are mentioned ex- 
plicitly, it is best to assume that all angles are implicitly given in radians 
(especially if they contain factors of 7). 
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where à is the wavelength of the light, D is the size of the aperture 
through which the observation is made and 06 is in radians. Taking 
the wavelength of light to be 600nm and the diameter of the pupil 
as 2mm, eqn (1.8) yields a resolution limit of 


_ 600 x 10-9 


x = 1076 ~ 0.017° 
50 ESTEE 300 x107 rad = 0.017°, 


or about one minute of arc, for the naked eye. Given that most people 
have difficulty in focusing on objects that are closer than 20 cm from 
their eyes, we can use the angular definition of eqn (1.9) to relate 
this 69 to a minimum resolvable size of around 60 um. Since this is 
roughly the breadth of a human hair, there is some wisdom behind 
the saying ‘as narrow as a hair’s breadth’! 

We should emphasize that the above analysis does not imply that 
things that are much smaller that 60 um cannot be detected with 
the naked eye. If they occur in isolation, and are sufficiently bright 
or well-lit, they should be visible. If two are closer together than 
60 um, however, we will not be able to distinguish them as separate 
entities; this loss of detail is the significance of resolution. The for- 
mula of eqn (1.8) is based purely on the diffraction of light through 
an aperture and represents the best limit achievable in theory; the 
resolution attained in practice can be worse for a variety of reasons. 
Poor lighting conditions lead to a larger 60, for example, even though 
the pupils then expand to D ~ 8mm. Likewise, a failure to wear 
spectacles, if required, degrades the resolution. 

Things that are too small to be seen easily with the naked eye 
can be made more accessible, of course, with the aid of a magnifying 
glass. Although references to magnifiers and ‘burning glasses’ are Radio waves 
found in the writings of Pliny the Elder (23—79 AD), and Egyptian 
artefacts from 640 BC include rock crystals in the shape of convex 
lenses, their use was very limited until the invention of spectacles 
towards the end of the thirteenth century. The greatest magnifica- 


Microwaves 


mm 
tion from a lens is attained when the object of interest is placed a dis- 
tance d from it that is comparable to its focal length f. Since a lens 
also constitutes an aperture whose diameter cannot be larger than 
2f (which would result from a spherical lens!), it yields a diffraction i 
limit on the smallest resolvable size, ôL, of around half the wave- Light 
length of light: : ‘ Heast 
ôL x ~xd > —~xf. 

D 2f nm 

While microscopes are more complex than simple magnifying glasses, R aTa 


the limit given by this elementary argument still holds: the shortest 

length scale that can be resolved by the best optical instruments is 

of order 0.2 um (with à ~ 400 nm for blue light). pm 
The above analysis tells us that, to see finer detail, we have to use 

light of a shorter wavelength. In particular, to achieve atomic res- 

olution, we must move beyond the visible and ultraviolet regions of Fig. 1.4 The electromagnetic 

the electromagnetic spectrum (Fig. 1.4) to X-rays. There is a serious spectrum. 


Gamma rays 
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Fig. 1.5 A poliovirus, 30 nm across, as 
seen by a transmission electron micro- 
scope. (Bubeck et al., 2005) 


All these fifty years of conscious 
brooding have brought me no nearer to 
the answer to the question, ‘What are 
light quanta? Nowadays every Tom, 
Dick and Harry thinks he knows it, but 
he is mistaken. 

(Albert Einstein, 1954, writing to his 


friend Michael Besso.) 


problem, however, in that it’s very difficult to produce good lenses 
for light of such short wavelengths; hence, X-ray microscopy is not a 
practical way forward. 

A solution to the quandary of having very short wavelength ra- 
diation that can be focused easily is provided by the electron: as a 
charged particle, its trajectory can be controlled with both electric 
and magnetic fields; thanks to quantum mechanics, it also exhibits 
wave-like behaviour. Electrons are readily produced by a hot wire 
and then accelerated to the desired speed by a potential difference 
of V volts; the only substantial requirement is that of a vacuum, to 
avoid energy losses through collisions with air molecules. The asso- 
ciated wavelength is given by eqn (1.13) as 


h 133x10 
—— > — M 
V2 meV VV i 
where the momentum p of the electron has been related to its mass 
Me = 9.11 x 107t kg and charge e = 1.60 x1071° C through the con- 
servation of energy (electrostatic to kinetic) in the classical limit: 


p’ 


2Me 


A= (1.10) 


eV = 


(1.11) 


Wave-particle duality and quantum mechanics 


We are familiar with the difference between the interactions of discrete 
objects compared with those of waves. Two colliding balls, for example, 
either bounce off each other or stick together; ripples on a pond, on the 
other hand, simply pass through each other unaffected but combine to 
give an interference pattern while overlapped. This everyday experience 
of the distinct characteristics of waves and particles does not seem to hold 
at the microscopic scale: objects that behave like particles in one situa- 
tion display phenomena associated with waves in another. Although the 
nature of light has been a source of much debate since the 17th century, 
with Hooke supporting Huygen’s wave theory while Newton regarded it as 
a tiny stream of particles (‘corpuscles’) travelling at high speed in straight 
lines, the inherently dual nature of matter at the smallest scales did not 
become generally apparent until the late 19th and early 20th centuries. It 
led to the birth of quantum mechanics. 

The wave—particle duality of nature is captured at the most elementary 
level by Planck’s hypothesis (1901) that an oscillation of frequency v is 
associated with a packet, or ‘quantum’, of energy E given by 


E = hv, (1172) 


where h=6.626 x10 ** Js is called Planck’s constant, and de Broglie’s pro- 
posal (1923) that a particle with momentum p is associated with a wave- 
length given by 


`=. (1.13) 
P 
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Only a hundred volts are needed, therefore, to accelerate electrons 
to wavelengths comparable to atomic length scales. More energetic 
electrons are used in practice (~ 200 keV), to enable them to pass 
through thicker samples and, perhaps surprisingly, to reduce the 
radiation damage. The latter imposes a serious limitation on the im- 
age resolution attainable with biological materials, because they de- 
grade rapidly with exposure, and it is lower than that for inorganic 
samples by an order of magnitude. An image of a 30nm poliovirus, 
with the best resolution currently achievable (~10 A), is given in Fig. 
1.5; and an inorganic example, where individual atoms can just be 
differentiated, is shown in Fig. 1.6. 

The electron microscopy considered above is the analogue of its op- 
tical counterpart, where the illuminating radiation passes through 
a suitably thin sample. An alternative to this ‘transmission’ mode 
of operation (TEM) is a ‘scanning’ version (SEM), where a highly 
focused beam of electrons is directed at successive small areas of 
the surface and a map constructed from the returned signal (prin- 
cipally of secondary emissions). Although the resolution achieved 
with SEMs is about an order of magnitude poorer than that of TEMs, 
the resultant images have a good depth of focus and are much easier 
to interpret. 

An even more direct approach to the study of surfaces is provided 
by scanning probe microscopes (SPMs), where an extremely sharp 
probe is raster-scanned in close proximity to the sample and topo- 
graphic maps constructed from a variety of interaction signals in- 
duced in the tip. Scanning tunneling microscopes (STM) were the 
first of this type of instrument, being developed in 1981, but many 
different kinds of atomic force microscopy (AFM) now exist. SPMs 
typically achieve a resolution in the nm to A range, can be oper- 
ated in a wide range of environments and temperatures, and work 
equally well for metals and soft materials. An example of an STM 
image is given in Fig. 1.7, which shows a rectangular array of atoms 
on a (clean) surface of silicon. 


1.3 Structure, dynamics and spectroscopy 


So far we have been thinking about looking at matter in the literal 
sense of an image. As in the case of an ordinary photograph, what 
appears like a frozen snap-shot from a potentially moving scene is 
actually a time-averaged view of the world. If the inherent motion 
is small compared with the resolution of the imaging system, or the 
equivalent of the camera’s shutter-speed is sufficiently fast, the pic- 
ture will be sharp; otherwise, it will be a blurred rendition of the 
object of interest. In either case, we can gain additional information 
on the nature and behaviour of matter at the atomic and molecular 
level by probing the dynamics on these length scales. This is the 
realm of spectroscopy. 


Fig. 1.6 A transmission electron micro- 
graph of cadmium sulphide. (Courtesy 
of Dr. J. H. Warner, Department of Ma- 
terials, Oxford.) 


Fig. 1.7 An STM image of atoms on 
the surface of silicon, 3.84A and 7.68A 
apart, taken under ultra-high vacuum 
conditions. (Courtesy of Dr. M. E. Haw- 
ley, Los Alamos National Laboratory.) 
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Fig. 1.8 A transition between two en- 
ergy levels, E and E2, and the corre- 
sponding spectral line of frequency v. 


The earliest form of spectroscopy consisted of the production of the 
colours of the rainbow when sunlight passed through a prism. While 
this had been known since antiquity, it was Newton (in 1666) who 
first realized the significance of this spectrum in terms of what it 
told us about the nature of light. With improving instrumentation, 
a large set of narrow dark lines became discernible in this spectrum 
by the early nineteenth century. Although Fraunhofer never un- 
derstood the origin of these spectral lines that now bear his name, 
Kirchhoff subsequently discovered that their patterns were a unique 
signature of the chemical composition of the light source. Indeed, 
when the dark ‘absorption’ lines from the spectrum of sunlight were 
compared with the corresponding bright ‘emission’ ones obtained 
on placing different atoms and molecules in a (Bunsen) flame, an 
anomaly was found that led to the prediction (by Lockyer, in 1870) 
and later confirmation (by Ramsay, in 1895) of helium! It’s named 
after the Greek sun god, Helios. 

A study of the systematic patterns in the wavelengths of the spec- 
tral lines from elements such as hydrogen, by Balmer and others in 
the late 19th century, led to Bohr’s quantum model of the atom in 
1913. It shared Rutherford’s picture of a compact positive core sur- 
rounded by orbiting negatively charged particles, much as planets 
going around the sun, but differed through its additional proposal 
that the angular momentum (and, hence, energy) of the electrons 
could only take certain well-defined values. The spectral lines then 
arose from transitions between the allowed energy levels (Fig. 1.8), 
with absorption resulting in an increase in the electron’s energy and 
an emission occurring due to a corresponding decrease. Although 
this simple model was able to explain the spectra of hydrogen-like 
atoms (nuclei with a single valence electron), it failed to generalize 
to more complex cases. Nevertheless, it serves as a useful historical 
example of how structural information can be gleaned from spectro- 
scopic measurements. 

By the early 1800s, Herchel and Ritter had demonstrated that 
the visible spectrum was only a small portion of the light coming 
from the sun. Subsequently, absorption and emission lines were also 
found at ultraviolet (UV) and infrared (IR) wavelengths; what can be 
learnt from them? The higher frequency, or shorter wavelength, UV 
spectra give additional information on the energy levels of the va- 
lence electrons whereas the longer wavelength, or lower frequency, 
IR spectra tell us about molecular vibrations. Since the latter re- 
late to the stretching and bending of bonds between atoms, IR spec- 
troscopy gives an indication of the connectivity in molecules. Not 
only is it helpful in identifying the presence of particular bonds, but 
clues about their environments can also be obtained: the stretching 
frequency of the C=O carbonyl group, for example, varies slightly de- 
pending on whether it occurs in an aldehyde, a ketone, a carboxylic 
acid, an ester or an amide. Thus, structure and dynamics are inti- 
mately linked. 
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A simple example of quantization 


If we pluck a string on a guitar, it vibrates and produces a note. What 
determines its frequency? Well, it’s governed by the length, thickness and 
tension of the string; and, to a lesser extent, the nature of the plucking. 
The principal variable at the musician’s disposal during a performance, 
therefore, is the length L, which can be changed by pressing a finger down 
on the relevant fret. 

The simplest vibration of the string consists of an up-and-down motion 
of a sine-shaped displacement which is constrained to be immobile at the 
ends; the n=1 case in Fig. 1.9 shows a snap-shot. This is called the funda- 
mental harmonic, and has a wavelength \=2L. Higher-order (sinusoidal) 
harmonics, which have shorter wavelengths, can also be excited but must 
fulfill the condition that 

IL = mAN (1.14) 


where n =1,2,3,..., to satisfy the immobility constraint at the two ends. 
This leads to the quantized normal modes shown in Fig. 1.9. 

The same mathematics arises when considering the case of a particle 
of mass m trapped in a one-dimensional potential well of length L from 
a quantum mechanics point of view. The allowed de Broglie wavelengths 
of eqn (1.14) are then related to momenta pn = h/An, where An = 2L/n, 
through eqn (1.13) and lead to the quantization of the associated kinetic 
energy En = p2/2m: 

n h? 
Pemi 
E] 


(1.15) 


n 


Absorption and emission spectroscopy can be extended even fur- 
ther from the visible spectrum, to microwave and X-ray wavelengths. 
While X-rays are related to transitions involving the deep electronic 
states of atoms, microwaves are associated with the excitations of 
molecular rotations (the basis of heat transfer in microwave ovens). 
There are many other forms of spectroscopy, most notably Raman 
and nuclear magnetic resonance (NMR), which provide information 
that is complementary to IR and microwave spectroscopy. Neutron 
scattering also has a role to play, albeit a niche one, as will be dis- 
cussed in Chapter 9. 


1.4 Atomic building blocks and interactions 


We noted earlier that X-ray microscopes were not feasible because 
good enough lenses were not available for light of such short wave- 
lengths. This lack of imaging hardware does not prevent us from 
examining how a beam of X-rays is scattered by a sample however, 
but what can we learn from such experiments? This question is at 
the heart of this book. To address it, we must first remind ourselves 
of the basic structure of atoms and their interactions. 


Fig. 1.9 The normal modes for a string 
of length L, fixed at both ends. 
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4 X4 is the ionic isotope of element 
X, defined by its atomic number z, 
of mass u and charge q. 


1.4.1 The basic structure of the atom 


In terms of the material in this book, and science in general, atoms 
can be viewed as being composed of three constituents: electrons, 
protons and neutrons. Although protons and neutrons are no longer 
considered to be fundamental, in the sense that they are made up 
of quarks, such subatomic subtleties are important only for particle 
physicists and cosmologists. So what are the basic properties of elec- 
trons, protons and neutrons, and how are they put together to form 
atoms? 

As first suggested by Rutherford in 1911, a small and very dense 
nucleus lies at the heart of an atom. Its radius is ten to a hundred 
thousand times smaller than that of the atom (several femtometres 
rather than an angstrom), but it contains nearly all the mass. The 
nucleus is a conglomeration of positively charged protons and neu- 
tral neutrons; both have a similar mass, around 2000 times that of 
an electron. A ‘diffuse cloud’ of negatively charged electrons sur- 
rounds the nucleus, at a large distance from it, and gives the atom 
its overall size. 

The element species is determined by the atomic number, which 
is the number of protons in the nucleus. This is also the number of 
electrons in a neutral atom. A loss or gain of orbital electrons yields 
an atom having a net positive or negative charge, respectively, and 
the resultant entity is called an ion. Neutrons play a role in the 
stability of the nucleus, and contribute about half the mass of the 
atom: the atomic mass is proportional to the total number of protons 
and neutrons. Atoms with the same number of protons but different 
numbers of neutrons are called the isotopes of an element, and are 
usually distinguished by specifying their atomic mass; for example, 
235U and 7°5U for uranium. The isotopes of the simplest element 
(with just one proton) have been given special names: 'H, which is 
the most abundant and lightest form (having no neutrons), is plain 
hydrogen; 7H, which is twice as heavy (with one neutron), is called 
deuterium; °H, the heaviest ‘stable’ variant (with two neutrons), is 
tritium. 

As well as the familiar properties of mass and charge, our three 
building blocks of nature have the additional quantum attribute of 
spin; this is difficult to visualize physically because it has no classi- 


Table 1.1 Attributes of the three principal constituents of atoms. 


Particle Charge Mass Spin 
(x10719 C) (x107? kg) 

Electron — 1.602177 0.000911 4 

Proton +1.602177 1.672622 4 

Neutron 0 1.674927 4 
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cal analogue. Electrons, protons and neutrons all have a spin of 1/2. 
Particles with half-integral spin (1/2, 3/2, 5/2, ...), such as these, 
are called fermions; only one can occupy any given quantum state. 
There is no occupational restriction for particles with integer spin 
(0, 1, 2,...), which are known as bosons. 

Although we no longer picture atoms as miniature solar systems 
with electrons going around the nucleus, quantum mechanics does 
yield a distinct set of allowed spatial distributions for the electrons 
called orbitals. They adopt the lowest energy states first but, being 
fermions, are forced to take up residence in ever higher ones as their 
number increases. The electrons in the outermost orbitals form the 
point of contact between atoms and determine how they combine to 
form molecules; this is the basis of chemistry. 


1.4.2 The fundamental forces of nature 


The matter that makes up the world around us interacts through 
four fundamental forces: gravity, electromagnetism and the weak 
and strong nuclear forces. The first two act on everyday and astro- 
nomical objects, and have a strength that falls off with distance r 
according to the familiar 1/r? inverse-square law. Gravity is by far 
the weaker of these interactions, as can be ascertained by compar- 
ing the magnitudes of the electrostatic and gravitational forces on a 
pair of isolated protons: 


Fae _ €?/ (Amer?) e? 


Fotav Gm3 Ir? ATE Gm? 


~ 1036 


where e and m, are the charge and mass of the proton, respectively, 
G is Newton’s universal constant of gravitation and €o is the permit- 
tivity of free space. Given the overwhelming strength of the elec- 
trostatic force, it can seem surprising that the motions of the stars, 
planets and galaxies are governed entirely by gravity. The reason is, 
of course, that charges tend to cancel out on the large scale, to leave 
essentially neutral bodies, whereas mass only adds up, to yield a 
cumulatively dominant attraction. 

Although gravity reigns supreme on a cosmological scale, electro- 
magnetism is the principal driving force behind the chemistry of life 
and the physics of everyday objects. It controls how charged objects 
interact with each other and with electric and magnetic fields. At 
the nuclear level, however, it cannot explain the existence of atoms, 
because a conglomeration of protons should blow itself apart due to 
the repulsive electrostatic force between charges of the same sign. 
This must be overcome by an even stronger attractive force which is 
effective on length scales of femtometres; it is known as the ‘strong 
nuclear force. 

The neutrons in the nuclei of some atoms spontaneously disinte- 
grate into an electron, which is ejected as (-radiation, a proton and 
an antineutrino (7). Neutrinos are ghostly particles, with no charge 


Fig. 1.10 One of the d-shell electron 
orbitals pertinent to transition metals. 


G = 6.674x10 71! Nm’ kg? 
€o = 8.854 x107" Fm! 
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Table 1.2 Attributes of the four fundamental forces. 


Force Strength Range Mediator Charge Mass Spin 


(relative) (m) (e) (kg) 
Gravity 1 oo Graviton 0 0 2 
Electromagnetic 1086 ore) Photon 0 0 1 
Weak 107° 10715 ZW O21 107% 1 
Strong 1088 107t5 Gluons 0 0 1 


and almost no mass (just spin), which are not of much concern to 
us, but the transmutation of a proton into a neutron increases the 
atomic number by one, thereby changing the identity of the atom, 
while leaving the mass virtually unchanged. This type of decay is 
controlled by a far less powerful short-range force called the ‘weak 
nuclear force’. 

In the nineteenth century, the experimental and theoretical work 
of Faraday and Maxwell showed that electricity and magnetism were 
intimately linked: a moving charge generated a magnetic field and 
a varying magnetic field induced current to flow in an electrical con- 
ductor. As such, electricity and magnetism were recognized as just 
being different manifestations of the same underlying phenomenon: 
electromagnetism. It has been a central goal of physics, since the 
late twentieth century, to understand the four forces discussed above 
in a similarly unified formalism. While some progress has been 
made, with the electroweak theory of Glashow, Salam and Weinberg 
in the 1960s combining electromagnetism and the weak interaction 
into a single force under suitably high energy conditions, a lot is still 
based on speculative ideas. Within the framework of these ‘grand 
unified theories’, or GUTs, force is seen as being mediated between 
the fermions that make up the building blocks of matter by ‘virtual’ 
bosons; these are particles (with integer spin) which come in and 
out of existence fleetingly, but in line with Heisenberg’s uncertainty 
principle in quantum mechanics. 


1.4.3 Probing matter by scattering particles 


Having reviewed the basic structure of atoms and the fundamen- 
tal forces, we can start thinking about how a beam of particles will 
interact with a sample when being scattered. The principal mech- 
anism will depend on whether the incoming probe has a charge, if 
it has spin, the incident energy and so on. An electron, for exam- 
ple, will be electrostatically repelled by the orbital electrons of the 
atoms, thereby preventing it from penetrating the sample by more 
than a small fraction of a micrometre; as such, electron scattering is 
useful only for studying the surface layers of materials. 
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An X-ray photon may not have a charge but, as an electromagnetic 
wave, it does consist of oscillating electric and magnetic fields. The 
charge of the orbital electrons interacts with these fluctuating fields, 
but not as strongly as with a particle having a fixed charge. X-rays 
penetrate matter more easily than electrons, therefore, and are able 
to pass through a few millimetres of aluminium. 

Neutrons are neutral particles that are unaffected by the charge 
of the orbital electrons. They interact with the nucleus of the atom 
via the strong nuclear force. Although inherently powerful, it acts 
only over very short distances which results in weak scattering. 
This offers the advantage of deep penetration (e.g. centimetres of 
aluminium), which enables the bulk properties of a material to be 
probed, but has the drawback of a poor scattering signal when a 
large sample is not available. If some of the atoms in the material 
under study are magnetic, by virtue of having unpaired orbital elec- 
trons, then the incident neutrons can be scattered by them through 
a spin-based dipole—dipole interaction. This mechanism can also 
operate in electron and X-ray scattering, but tends to be negligible 
compared to those mentioned above; with neutrons, however, nu- 
clear and magnetic scattering occurs at a similar rate. This has 
made neutrons the dominant probe for studying magnetism at the 
atomic level, but the shear X-ray intensity of the new synchrotron 
sources is starting to make them a viable alternative. 
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Fig. 1.11 A schematic illustration, following Pynn (1990), of the principal interac- 
tions by which electrons, X-rays and neutrons are scattered by condensed matter. 
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1.5 Energy, length and temperature scales 


We began this introductory chapter by discussing length scales. Let’s 
conclude by linking them with some corresponding measures of en- 
ergy and temperature that are in common usage. 

The central connection between length and energy comes from the 
original postulates of Planck and de Broglie, given in eqns (1.12) 
and (1.13), that were put forward to explain quantum phenomena. 
Starting with the former, a photon of frequency v is associated with 
energy E through E =hv where h is Planck’s constant. Hence, X- 
rays can be characterized equally well by their energy as by their 
frequency or, through 


c 
— 1.16 
7 ( ) 
where c is the speed of light, by their wavelength à. Since the ener- 
c = 2.998x10°ms * gies involved are tiny in the SI unit of joules, they are often stated in 
terms of the equivalent kinetic energy that an electron (with charge 
e) would gain when accelerated through a potential difference of V 
volts; 1 eV, or an electron-volt, is therefore about 1.6 x107+° J. These 
expressions for the energy yield the relationships 
h 
P= sirs. (1.17) 
This enables an X-ray photon with a wavelength comparable to atomic 
spacing to be specified in several alternative ways: 


A=1A <= v=3x10°THz <=> E=12.4keV, 


and it is just a matter of context, or subject convention, as to which 
one is preferred. 

Whereas it’s easy to think of X-rays as waves, with their electric 
and magnetic fields oscillating at frequency v, the same is not true 
of neutrons: they are one of the building blocks of atoms. It is more 
natural to consider them in terms of their particle properties, such 
as mass my and speed v. A connection with a length scale can then 
be made through eqn (1.13) by using p = mnv, the classical mechan- 
ics formula for the magnitude of the momentum: 


h 


Mpu ` 


A= 


A link to energy follows readily from equating E with the the kinetic 

energy of the neutron: 

2o h? 
2M A? 

This means that a neutron with a wavelength comparable to atomic 

distances can be specified equally well by 


E imav (1.18) 


A=1Å or v=3.96kms7™! or E= 81.8 meV. 


1.5 Energy, length and temperature scales 17 


We should note that chemists, especially spectroscopists, tend to use 
an alternative measure of energy called ‘wavenumbers’, 


Energy equivalent in wavenumbers = l (1.19) 

which corresponds to the reciprocal of the wavelength of a photon 
having energy E. The standard units are cm~!, and are interpreted 1meV = 8.066cm * 
as being the number of wavelengths of light that would occupy a 
distance of 1cm. 

Another useful measure of energy that is appropriate for particles 
such as neutrons is the temperature T: their average speed (v) is 
higher when they are in thermal equilibrium with something hot 
than when they are cold. In fact, an important result from thermo- 
dynamics, called the equipartition of energy, shows that each ‘de- 
gree of freedom’ (or movement) is associated with a mean energy of 
i ksT, where temperature is measured in degrees Kelvin (K) and TK =T°C +273 
k,=1.38x10~22 JKT! is known as Boltzmann’s constant. An atom of 
a monatomic gas, for example, which has three degrees of freedom, 
as it can (only) drift in the x, y and z directions, has an average 
kinetic energy of (3/2) ksT: 


(E= imu = shel. 


For neutrons, it has become conventional to equate the energy and 
temperature through the simplified relationship 


E = kpl; (1.20) 
so that room temperature, T = 293K, corresponds to 
A}=1.80A or v=2.20kms7! or E= 25.2 meV , 


or, for the spectroscopists, 204 cm~! in wavenumbers. 


> va = 2kpT/m 


(vy? = = kpT/m 
lv’) = 3kpT/m 


Fig. 1.12 The Maxwell—Boltzmann distribution of the frequency f of the speeds v of 
the atoms of a gas with mass m in thermal equilibrium at temperature T. It has a 
maximum at Umax, an average value of (v) and a root-mean-square of vas = (v”). 
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1.6 A table of useful constants 


Throughout this chapter we have made use of certain physical quan- 
tities, such as the mass of the neutron, the charge of the electron, 
Planck’s constant and so on. We now list these, and other constants 
that will be encountered in this book, together in one place to pro- 
vide a helpful reference. 


Quantity Symbol Value (SI units) 
Speed of light in a vacuum c 2.997925 x108 ms7! 
Planck constant 6.626069 x 10734 Js 
Planck constant (h/27) h 1.054562 x 10-34 Js 
Boltzmann constant ka 1.380651 x10723 JK 
Avogadro constant Na 6.022142 x 1073 mol! 
Gas constant R 8.31447 JK} mol"! 
Constant of gravitation G 6.6742 x 1071! Nm? kg~? 
Permittivity of free space Eo 8.854189 x 10712? Fm! 
Permeability of free space Lo 1.256637 x107 Hm! 
Fine structure constant Q 7.297353 x 1073 
Elementary charge e 1,602177 x10 © 
Mass of an electron Me 9.109383 x 10731 kg 
Mass of a proton Mp 1.672622 2104" kg 
Mass of a neutron Mn 1.674927 x 107?" kg 
Classical radius of electron Te 2.817938 x10715 m 
Bohr radius of atom ao 5.291771 x107!!! m 
Bohr magneton TA 9.274009 x107 JT! 
Nuclear magneton Un 5.050783 x107?" JT! 


Waves, complex numbers and 
Fourier transforms 


The theory of X-ray and neutron scattering relies heavily on the 
mathematics of waves. This chapter provides a tutorial introduction 
to the basic physical concepts, and the associated analytical tools, 
needed for an understanding of wave phenomena. 


2.1 Sinusoidal waves 


An everyday description of a wave would be a ‘wiggle’, or something 
that goes up-and-down as you move forward. The progression of the 
fluctuations could refer to changes in ‘height’ with respect to position 
at a fixed time, or with respect to time at a fixed position. Several 
examples of geometrical waves are shown in Fig. 2.1; they are un- 
usual in that they have points where there are abrupt changes in 
the value of the function or its gradient. What they have in common 
with the more familiar sinusoidal variation of Fig. 2.2 is a regularly 
repeating pattern. 

The sine and cosine curves of Fig. 2.2 are regarded as the archety- 
pal waves, as they occur in many elementary physical situations; for 
example, the vibrations of an elastic string. Their smooth character- 
istics also make them amenable to analytical manipulation. An easy 
way of visualizing sinusoidal variations is to think about the projec- 
tion of a circular motion onto the horizontal and vertical axes, as 
illustrated in Fig. 2.3. 


Y 


Fig. 2.2 The sinusoidal curves, or waves, 7 = A sin and w = A cos 8. 
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Fig. 2.1 Geometric examples of waves: 
square, triangular and exponential. 


Fig. 2.3 The generation of sinusoidal 
variations through circular motion. 
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The two curves shown in Fig. 2.2 are identical apart from a lateral 
shift of 7/2 radians, or 90°: cos@ = sin (6 + 7/2). Hence, the general 
expression for a function of this type is 


p = Asin(f +o), (2.1) 


where A is the amplitude of the wave and the angle ¢, or the phase, 
controls its horizontal displacement with respect to sin 6. If the @ in 
eqn (2.1) varies linearly with position xz, so that 6 = kx where kisa 
constant, then we obtain a sinusoidal variation with respect to this 
physical coordinate: 


w=Asin(kx+¢). (2.2) 


Since the sine curve cycles around every 27 radians, the correspond- 
ing repeat distance, or wavelength A, can be found from 


= 2T 


k 
A 


(2.3) 


This is called the wavenumber and has SI units of radm~'. Note 
that, as mentioned in Section 1.5, spectroscopists use the same term 
for 1/A given in cm}. 

If the ¢ in eqn (2.2) itself varies linearly with time t, so that it can 
be written as ¢ = o —wt where ġo and w are constants, then we 


obtain the travelling wave 
p = Asin(kr—wt+ ġo). (2.4) 


That is to say, with w > 0, the sinusoidal variation in x moves steadily 
towards the right as time evolves; this is illustrated in Fig. 2.4. The 
crests and troughs of the translated wave will coincide with those of 
an earlier time after a duration T, called the period, given by 


w = — |. (2.5) 


Fig. 2.4 The travelling wave of eqn (2.4) plotted as a function of x for several values 
of t, from zero to a quarter of the period. 


The reciprocal of T, usually denoted by v, is known as the frequency 
of the wave. It is related to its angular variant, w, through 


w = 2rV, (2.6) 


with w specified in rads~! and v in cycles per second or Hz (hertz). 
The speed of the wave, c, follows readily from the observation that 
it moves forward by a distance \ in a time T: 


c== — = vÀ, (2.7) 


in agreement with the result quoted in eqn (1.16). 


2.1.1 The direction of propagation 


A negative prefactor was chosen for the wt term in eqn (2.4) so that 
the wave would travel in the positive x direction; the opposite sign, 
w=Asin(kaz +wt + ġo), gives a wave that moves backwards. In fact, 
a reversal also occurs with y% = Asin(—kgæ — wt + ġo). Is there any 
reason for preferring one of these alternatives over the other to de- 
fine the sense of the progression? 

Conceptually, it would make more sense to associate the change of 
sign with the spatial term, rather than the temporal factor, because 
we are concerned with an orientation. This line of thought leads 
to the following generalization to accommodate fully the directional 
aspect of waves: 


p = Asin(ker—wt+ do), (2.8) 


where the bold script k and r are vectors, and the dot between them 
indicates their ‘scalar multiplication’. The vector r denotes a general 
position in space, with coordinates x, y and z, but what do the three 
components, kr, ky and kz, of the wavevector k represent? Its mag- 
nitude, or modulus, |k|=k is the familiar wavenumber of eqn (2.3), 
and its orientation indicates the direction of propagation. For a wave 
travelling along the x direction, with k, = k, =0, the scalar product 
k.er=k,x where k,=k for a forwards progression and k,=—k for 
the reverse. 

Since r and k are generally three-dimensional vectors, the wave 
of eqn (2.8) tends to be a function of x, y, z and t. As such, it rep- 
resents a travelling ‘plane wave’ rather than a moving oscillation 
on a string. That is to say Yy, which could be the air pressure in a 
sound wave, is uniform in planes perpendicular to k, but its value 
varies sinusoidally with time in the direction of the wavevector in 
accordance with the wavelength of eqn (2.3), the period of eqn (2.5) 
and the speed in eqn (2.7). The situation is illustrated for the two- 
dimensional analogue in Fig. 2.5. 
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Fig. 2.5 The geometry of a plane wave. 
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axb b 
G) 
bxa 


laxb| = |al |b] sin é 
aeb = bea 
axb = —-bxa 


Magnitudes, directions and vectors 


Quantities that have both a magnitude and a direction, such as a force, 
are called vectors. Unlike scalars, which only have a ‘size’, they cannot 
be quantified by a single number. They are defined by coordinates, or an 
array of numbers giving displacements with respect to a set of reference or 
basis axes. In the most common case of an x, y and z or Cartesian system, 
the vectors a and b can be written as 


a = (0002) aml D = (Or; Dyz) = 


Addition and subtraction are straightforward, in that the corresponding 
components are just combined separately: 


a+b = (as + bz, dytby,az+bz) , 


with all the pluses replaced by minuses for a take away. The multiplica- 
tion of a vector by a scalar, u say, is also easy, 


pa = (Uae, fay, Laz) , 


and yields a vector with the original direction but an appropriately scaled 
length. The modulus, magnitude or length of a vector is given by Pythago- 
ras’ theorem; it’s one for a unit or normalized vector. 
A vector can be multiplied by another in two different ways. The first is 
a ‘dot’ or scalar product, which is a sum of the products of corresponding 
elements: 
aeb = azrbr + dyby + azbz (2.9) 


and is geometrically the modulus of a times the modulus of b times the 
cosine of the angle between them. Vectors of non-zero length are perpen- 
dicular, or orthogonal, to each other if their dot product is zero; if they are 
also of unit length, they are said to be orthonormal. 

Vectors can also be multiplied by a ‘cross’ or vector product. This is a bit 
more complicated since the result is a vector: 


axb = (ayb:— 0, ay, 020, — 0,0. Ta Oy OnT) - (2.10) 


Geometrically, its magnitude is equal to the modulus of a times the mod- 
ulus of b times the sine of the angle between them; this is also the area of 
the related parallelogram. The direction of the cross product is perpendic- 
ular to both a and b, and given by the ‘right-hand screw rule’: if the curl 
of the right-hand fingers indicates the sense of rotation needed to go from 
a to b, then the direction is given by the out-stretched thumb. 

The physical meaning of a dot product holds irrespective of the dimen- 
sionality of the vectors and eqn (2.9) generalizes in an obvious way. The 
same is not true of a cross product, which is specific to a space of three di- 
mensions (as considered here). The scalar product is also symmetric with 
respect to an interchange of a and b whereas the vector product is anti- 
symmetric: the latter changes sign but the former does not. Division by a 
vector is not defined and must never be performed. 


2.1.2 The principle of superposition 


A central feature of waves is that they pass through each other un- 
affected and, where overlapped, give a net result that is the sum of 
the individual contributions. This principle of superposition lies at 
the heart of scattering theory. Here we illustrate it with a couple 
of one-dimensional examples involving the combination of just two 
sinusoidal waves. 

Consider first the case where the waves are identical but travel- 
ling in opposite directions. With the simplifying assignments that 
A=1and ¢, =0 in eqn (2.4), to reduce the algebraic clutter, the 
principle of superposition yields 


w = sin(kx —wt) + sin(—kr-—wt) 
= —2 sin(wt) cos(k x), 


where the second line follows from a trigonometric ‘factor formula’ 
and the antisymmetric properties of the sine function. This is called 
a stationary wave (Fig. 2.6), because there is no movement with time 
along the x direction. The separation of 7 into a product of spatial 
and temporal terms results in a purely ‘up-and-down’ oscillation, at 
a frequency of w, with an amplitude that varies sinusoidally with 
wavelength \ = 27/k. The locations at which the amplitude is zero 
are called nodes. 

As a second example, consider two waves travelling in the same 
direction with equal amplitudes but slightly different wavelengths 
and frequencies: k + Ak and w + Aw, where the A-terms represent 
small departures from the average k and w. With the simplification 
that A=1 and ¢,=0, as before, Yy is now a product of two travelling 
waves: 


V = sin([h+AK] a — [w+Ault) + sin([k-Ak]a—[w— Ault) 
= —2sin(kz—wt) cos(Aka2—Awt) , 
Y 
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Fig. 2.7 The slowly varying ‘beating’ modulation, of wavelength 27/Ak, propagates 
with a speed of Aw/Ak, whereas the finer structure inside the envelope has the 
properties of the average wavelength and frequency, w and k. 
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sin X+sin Y = 2 sin(=£*) cos (455) 


Fig. 2.6 A stationary wave, plotted as 
a function of x for several values of t. 
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and is illustrated in Fig. 2.7. The amplitude of a sinusoid with the 
mean wavelength of 2r /k, propagating with a speed w/k, is mod- 
ulated by a slowly varying envelope of wavelength 27/Ak, moving 
with a speed of Aw/Ak. This is the origin of the beating that is 
heard when neighbouring musical notes are played together: the 
sound becomes periodically louder and quieter. 

Although we have only considered the combination of two simi- 
lar waves, its generalization to the sum of many such components 
results in the formation of wavepackets; the beating modulation of 
Fig. 2.7 is just the most elementary example. The shape of the 
wavepacket will be preserved on propagation if all its constituents 
travel with the same speed c, when 

Ww dw 


p and F 
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The relationship between two quantities, x and y say, can be visualized 
with the aid of a graph. While the intersections of the associated curve 
with the xz and y axes may be of interest, it is often more important to 
know the slope at any given point; that is, how quickly y increases, or de- 
creases, as x changes, and vice versa. This issue is at the heart of the topic 
of differentiation, and the related rules and formulae are simply ways of 
calculating the gradient algebraically. 
Let us begin with a precise definition of what is meant by the slope of 
a curve. Suppose that y is related to x through some function called f’, 
usually written as y = f(x), so that f(x) = ma+c for a general straight 
line, and f(x) = sin(x) for a sinusoidal variation, and so on. Then, if the 
horizontal coordinate changes from x to x+ Azv, where Ax represents a 
small increment, the value of y is altered from f(x) to f(x+Az). The 
gradient, at a point x, is defined to be the ratio of the change in the ver- 
tical coordinate, Ay, to that of the horizontal increment, as Ax becomes 
vanishingly small. This can be stated formally as 
dy ay f(a + Ax) — f(x) 
dz on een Ar = am AT í Ty) 
where dy/dz is known as the derivative, or differential coefficient, and 
is pronounced ‘dy-by-dx’. The tendency of Az — 0 has to be approached 
gradually to ascertain the limiting value of the ratio Ay/Az, as both in- 
crements are individually equal to zero when the condition is met. Strictly 
d speaking, we should check that the same value of dy/dz is obtained 
= FPD =i z) whether Az is positive or negative, but this is assured as long as the 
curve y = f(x) is ‘smooth’; inconsistencies will arise if kinks and sudden 
dy tju breaks (or discontinuities) are present, and the function is said to be non- 
Y s2 ==> ($4) = f(x) differentiable at those points. 


If the speed of the sinusoidal waves varies with their wavelength, 
because the frequency does not happen to be directly proportional 
to the wavenumber in the medium of interest, then the wavepacket 
will change with time. This phenomenon is called dispersion, and 
the relationship between w and k which determines the nature of 
the ‘spreading’, 

w= wk), 


is called the ‘dispersion relation’ or the ‘dispersion curve’ (Fig. 2.8). 
For the non-dispersive case, w =c k, there is a unique speed, c, as- 
sociated with the propagation of a wavepacket. The ratio w/k and 
the derivative dw/dk still yield useful characteristic speeds, how- 
ever, when there is a dominant contribution from sinusoidal waves 
around a particular wavelength: 
Ww _ dw 

ve =F and Ug = dk’ 
where vg is called the phase velocity, and gives the rate at which the 
crests and troughs of the local wavefront move, and vg is the group 
velocity, which indicates how fast the envelope of the wavepacket 
travels. 


(2.12) 


2.2 Complex numbers 


The analysis of wave phenomena is aided greatly by the use of com- 
plex numbers. In particular, by a result which links an exponential 
to sines and cosines: 


e = cos +isind | , (2.13) 


where i? = —1. Since complex numbers play a central role in theo- 
retical work, we will devote a few pages to them; as with most of the 
mathematical background given in this book, the material is based 
on Sivia and Rawlings (1999). 


2.2.1 Definition 


If any number, integer or fraction, positive or negative, is multiplied 
by itself, then the result is always greater than, or equal, to zero. 
What, then, is the square root of —9? To address this question we 
need to invent an imaginary number, usually denoted by ‘i’, whose 
square is defined to be negative: 


CS. (2.14) 


A real number, say b (where b? > 0), times i is also imaginary; it’s 
just b times bigger than i. If a is also an ordinary number, then the 
sum z of a and ib, 

z =a+ib, (2.15) 
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O 


Fig. 2.8 A dispersion curve, and the 
related phase and group velocities for 
waves in the neighbourhood of ko. 
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1421+ 6+-) = 443i 


(496 =i) = 545i 


is known as a ‘complex’ number; this does not indicate an intrinsic 
difficulty with the concept, but highlights the hybrid nature of the 
entity. It consists of both a real part and an imaginary one: 


Reiz} =a and Zm{z}=b. (2.16) 


It may seem odd that Zm {z} is b rather than ib, but this is because 
it represents the size of the imaginary component. 


2.2.2 Basic algebra 


To add or subtract complex numbers, we simply add or subtract the 
real and imaginary parts separately: 


a+ib +(c+id) =a+tc+i(b+d), (2.17) 


where a, b, c and d are real. The usual rules of algebra apply for 
brackets and multiplication, except that every occurrence of i” is 
replaced by —1. Thus, it’s easy to show that the product of a + ib 
and c+id is given by 


(a+ib)(c+id) = ac—bd+i(ad+bc), (2.18) 


since ibd = —bd. Division involves the use of a complex conjugate, 
so let us consider this first. 

The conjugate of a complex number z, denoted by z*, is defined to 
have the same real part but the opposite imaginary component; that 
is, Re{z*} = Re{z} and Im {z*} = —Im {z}. In terms of eqn (2.15), 
therefore, 

z* = (a+ib)* = a — ib. (2.19) 


Hence, complex numbers and their conjugates satisfy the following 
relationships: 


z+z* = 2a = 2Re{z} 
z=z* = 216 = 2ilm{4z} (2.20) 
z2* = a+b? = zl? 


We will come to the meaning of |z| shortly, but the important point 
about eqn (2.20) is that the product z z* is a real number. This fea- 
ture enables us to calculate the real and imaginary part of the ratio 
of two complex numbers by multiplying both the top and bottom by 
the conjugate of the denominator 

a+ib a+ib  c—id ac+bd+i(bc—ad) 


_ — eee 2.21 
ad ck c=- EFF (2.241) 


To evaluate the ratio (1+21i)/(3—i), for example, we multiply it by 
unity in the form (3+1)/(3+1); this gives a real denominator of 10, 
and a complex numerator of 1+7i. Hence the result is 1/10 +17/10. 


2.2.3 The Argand diagram 


So far we have considered complex numbers from an algebraic point 
of view; it is often helpful to think of them in geometrical terms. 
This is easily done with the aid of an Argand diagram where the 
horizontal, or x, axis of a graph is seen as representing the real part 
of a complex number, and the vertical, or y, axis gives the imaginary 
component. Thus the point with (x, y) coordinates (a, b) corresponds 
to the complex number a + ib. Its conjugate z* is a reflection in the 
real axis. An alternative way of specifying the location of a point ona 
graphis through its distance r from the origin, and the anticlockwise 
angle 0 that this ‘radius’ makes with the (positive) real axis. In this 
system r is known as the modulus, magnitude or amplitude of z; 0 
is called its argument or phase. 

The quantities r, 0, a and b in the Argand diagram are related 
through elementary trigonometry by 


a = rcosů and b=rsiné, (2.22) 
or, in the reverse sense, by 


m= +0 and @ = tan b/a) (2.23) 
A comparison between eqns (2.20) and (2.23) shows that zz* =r?, 
where r = |z| is the modulus of the complex number. The second 
part of eqn (2.23) needs qualification because there is an ambiguity 
of 180° with tan~' (b/a). The arctangent of unity, for example, could 
be either 45° or —135°. For complete consistency with eqn (2.22), 
0<0<7 (radians) for b>0 and —7<@<0O for b<0; ô is zero for a>0, 
and +7 for a<0, if b=0. It is also worth remembering that @ is only 
defined to within a factor of 27, because we could add (or subtract) 
any integer number of 360° to it and obtain the same point in the 
Argand diagram. 


2.2.4 The imaginary exponential 


Perhaps the most important result in complex analysis concerns the 
exponential of an imaginary number: 


e? = cos + isin 6, (2.24) 


where @ is in radians. This equation can be verified by substituting 
x =10 in the Taylor series expansion for e”, and collecting the odd 
and even powers of 0 separately; remembering that i” = —1, a com- 
parison with the Taylor series for sin and cos@ yields eqn (2.24). 
The product of r and e? allows a complex number to be expressed 
in a very compact form in terms of its modulus and argument: 


z = a+ib = r (cos + isin) = ro’, (2.25) 
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where a, b, r and @ are related through eqns (2.22) and (2.23). As 
can be seen from the Argand diagram, and verified by the symmetry 
properties of sines and cosines, its conjugate entails the replacement 
of 0 with — 90: 


z* = a—ib = r (cos0 — isinb) = re’, (2.26) 
from which the result z z*= r? follows immediately. 

Although the exponential form of a complex number is very useful 
when dealing with roots and logarithms, and provides a valuable in- 
sight into products and quotients, our interest here is in its relation- 
ship with waves. This hinges on eqn (2.24), which enables eqn (2.8) 
to be written as the imaginary part of 


w — Ap rmo, (2.27) 


where A is now a complex number whose modulus and argument 
give the amplitude and phase offset of the wave, respectively: 


A = |Alel?. (2.28) 


The real part of eqn (2.27) also represents the same wave, apart 
from a difference of 90° in the value of ġo. 


Taylor series 


When dealing with a complicated function, it can be useful to approxi- 
mate it with one of a simpler form. While the latter may not represent a 
complete and accurate description of the situation at hand, it frequently 
provides the only means of making analytical progress. There are many 
approximations that could be used, of course, but it is the one that cap- 
tures the salient features that is most helpful. A Taylor series is appropri- 
ate when our principal interest lies in the behaviour of a function in the 
neighbourhood of a particular point. 

Consider the curve y=f(x). The crudest approximation to this function 
is a horizontal line y= ao, where ao is a constant; if a) =f (xo), then it will 
even be correct at x= xo. A better approximation would be a sloping line 
Y = a + aı(£— zo), where the coefficient a; allows for a non-zero gradient. 
Continuing along this path, we could add a quadratic (or curvature) term 
ax(%—2a)”, a cubic contribution a3(a—2.)*, and so on, to gain further im- 
provements. Thus, a function f(z) can be approximated about the point 
zo by using a polynomial expansion: 


f(x) © ao + as(a—20) + Go(a@—20)" + a3(a—20)? +- °°. (2.29) 


This is the essence of a Taylor series. Its advantage is that the right-hand 
side of eqn (2.29) is usually easier to calculate, differentiate, integrate, 
and generally manipulate, than the expression on the left. The case of 
Xo =0, when the Taylor series simplifies, is called a Maclaurin series. 


The benefit of using eqn (2.27) over (2.8) in wave analysis is that 
exponentials are easier to deal with mathematically than sinusoids; 
multiplication, differentiation and integration, for example, are more 
straightforward. As an illustration of this advantage, let’s derive the 
‘compound angle’ formulae for sines and cosines with complex num- 
bers. Starting with the rule of eqn (1.2) for combining powers, 


el (a+) — ela eib. 
and expanding the exponentials with eqn (2.24), 
cos (a+) +isin(a+Z) = (cosa +isina) (cos 8 + isin 8), 


the equating of the real and imaginary parts on the left- and right- 
hand sides yields the desired results: 


cos (a+8) = cosa cos — sina sinf, (2.30) 
sin(a+Z) = sina cosZ + cosa sin’. (2.31) 


As well as being the real and imaginary parts of exp(i@), sines and 
cosines can also be expressed as 
10 —10 iĝ _ 10 
cos = din i and sinf = iit f (2.32) 
2 21 
which follow from the addition and subtraction, respectively, of eqn 
(2.24) with its complex conjugate. 


2.3 Fourier series 


Let’s begin wave analysis by considering how periodic signals, such 
as those in Fig. 2.1, can be decomposed into the sum of sinusoids. 
Suppose that the function f(x) repeats itself after a ‘distance’ of A, 
so that 

f(x) = f(x +A). (2.33) 


This has the same periodicity as sines and cosines of wavenumber 
k = 27/X. A simple approximation to f(x), which matches its wave- 
length, is therefore 


f(x) © a + a cos(kx) + bisin(kx), (2.34) 


where a, a, and b, are constants whose values need to be selected 
in some way. The crudest assignment would be to set both a, and b, 
equal to zero, giving an invariant f(x) ~ a, but the linear combina- 
tion of sin(kx) and cos(kx) allows for a sinusoidal variation with the 
correct period and an appropriate amplitude and phase: 


acos(kx) + bsin(kx) = Asin(ka+@) , 


where a= Asin@g and b= Acos ọ, in accordance with eqn (2.31). 
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sin(@) = — sin(—6) 


cos(0) = 


cos(—6@) 


The sines and cosines of 2 kz, 3kx, 4kx, and so on, also satisfy the 
periodicity of eqn (2.33); they just go through several, or many, com- 
plete cycles in the interval A. We can obtain a better approximation 
to f(x), therefore, by including contributions from these higher-order 
terms: 


f(a) = ao + a,cos(kz) + a,cos(2kz) + acos(3kz)+ +- 


2.35 
+ b sin(kz) + b sin(2 kx) + 6b, sin(3kx) + --- ( ) 


This expansion is called a Fourier series, and eqn (2.34) is simply 
the first-order version of it which contains only the lowest, or funda- 
mental, harmonic. 

We will come to the evaluation of the coefficients a,, and bn, for 
integer n, shortly but note that one of the sets goes to zero if f(x) 
possesses a symmetry about the y-axis: 


f(-r) => b,=0 


f(a) = (2.36) 


f(-x) = an=0 


because sines and cosines are odd and even functions, respectively. 
The generalization of eqn (2.35) explains why the invariant term is 
designated as a, and why there is no corresponding b, (apart from 
its general redundancy): they are the coefficients of cos(0)=1 and 
sin(0)= 0, with the b, being unnecessary since it adds nothing to the 
Fourier series. 


2.3.1 Orthogonality and the Fourier coefficients 


A prescription for the a, and bn in eqn (2.35) presents itself once 
we realize that the related sine and cosine functions are orthogonal. 
By this we mean that the integral of the product of any two over the 
interval of the period à will be zero, unless they happen to be exactly 
the same functions: 


; 0 it mÆn, 
J sintra) sin(nkz)dzr = N (2.37) 
0 2 


fm=n, 


with an identical expression for cos(m kx) cos(nkax), but n#0, and 
x 
fsm kx) cos(nkx) dx = 0. (2.38) 
0 


Although these sines and cosines aren’t perpendicular in a geomet- 
rical sense, this type of integral is the functional analogue of a dot 
product which is zero for orthogonal vectors. 

If we multiply eqn (2.35) through by one of the sine or cosine func- 
tions, sin(mka) or cos(mka), and integrate the resultant products 
over the period à, then all but one of the terms on the right-hand 


side will be zero due to eqns (2.37) and (2.38). The surviving m=n 
contributions yield the formulae for the Fourier coefficients: 


A A 
én = 3 ft) cos(nkx)dz and bn = 3 ft) sin(nkx)dz (2.39) 
0 0 


for n =1,2,3,..., from which eqn (2.36) can be verified. If eqn (2.35) 
is integrated over the period à as it stands, then the constant a, is 
seen to be the average value of f(x): 


d 
[ro dz . 
0 


Cumulative properties and integrals 


(2.40) 


Ao = 


>= 


While differentiation is concerned with the slope of y = f(x), integration 
deals with the ‘area under the curve’. This relates to the average and 
cumulative behaviour of y, over some range in z. 

To set up a definition of an integral, consider the region bounded by the 
straight lines x =a, x =b and y= 0, and the curve y = f(x). The size 
of the enclosure can be estimated by approximating it as a whole series of 
narrow vertical strips, and adding together the areas of these contiguous 
rectangular blocks. If the x-axis between a and b is divided into N equal 
intervals, then the width of each strip is given by Ax = (b—a)/N; the 
corresponding heights of the thin blocks are equal to the values of the 
function f(x) at their central positions. In other words, the area of the j+” 
strip, which is at x= zx; and of height y = f(x;), is f(x;) Az; the index j 
ranges from 1 to N, of course, with zı =a +Ax/2 and zy=b—Az/2. As 
N tends to infinity, Az —0 and the approximation to the area under the 
curve becomes ever more accurate. This limiting form of the summation 
procedure defines an integral 


b b 


N 
[vac = [to ae = ee GS 


a a 


(2.41) 


where the symbol fdz is read as the ‘integral, from a to b, with respect 
to x’. The use of the term ‘area’ in the above discussion needs some qual- 
ification, in that it can be negative; this is because the ‘height’ of a strip 
f(xj)<0 whenever the curve y = f(x) lies below the x-axis (and even the 
width Ax <0 if 6 <a). 

Although an integral is defined as the limiting form of a summation, it 
is usually calculated analytically by noting that ‘integration is the reverse 
of differentiation’. While this may not be obvious, it is easily illustrated 
with an example from everyday kinematics: the distance travelled by a 
car (say) is the integral of the speed with respect to time, and speed is the 
rate of change of distance with time (a derivative). 
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A 
[sin(nka) dz = 0 
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2.3.2 The complex Fourier series 


The Fourier series of eqn (2.35) can be written in a very compact 
form by using complex numbers: 


f(x) = ` Cn ane (2.42) 


n =—CoO 


where the © stands for a summation over integer values of n, from 
—oo to co, and the approximation has been replaced by an equality 
to denote a definition. The right-hand side of eqn (2.42) will yield a 
real function, f(x), as long as the complex coefficients c,, satisfy the 
conjugacy condition 

C4 "6, (2.43) 


This follows from eqn (2.20) because the contribution to the sum 
from pairs of positive and negative values of n of equal magnitudes 
will then be 


Cn einka Ae Guy Tinke = J Re S einke | 
= a, cos(nkz) + by sin(nka) , 


for n#0, where we have substituted 2c, = a,—ib, in the second 
line to obtain consistency with eqn (2.35). In fact, the formula for 
the complex coefficients is simply 


A 
Gn = ifto) g nea dy (2.44) 
0 
with co = Hp: 


2.4 Fourier transforms 


We began our discussion of Fourier series by considering how a peri- 
odic function could be decomposed into, or approximated by, a sum of 
sinusoidal waves. The analysis can be extended to the non-periodic 
case by letting \ — œo, so that no repetitions are required of f(x) 
within a finite interval. To carry out this limiting procedure, it is 
helpful to define 


_ an 


A 


because the wavenumber of the fundamental harmonic, Ak, shrinks 
gradually to zero as \ gets ever larger and kn approaches a contin- 
uum even with integer n. The imaginary exponentials of eqns (2.42) 
and (2.44) can then be written as 


Ak and kn = nAk 


and e , 


and the coefficients expressed as 
Cn = Qa F(kn) Ak, 


where a is a constant and F(k) is a continuous function of k. With 
these substitutions, eqns (2.42) and (2.44) become 


oo à/2 
f(x) =a F(kn)dè™ Ak and F(kn)= zi | fle) de, 
Poo —\/2 


where we have expressed the integral over a period as being from 
—/2 to A/2, instead of 0 to \, for a more symmetrical appearance. 
The limit of \—+co, when Ak—0, can now be taken safely, and yields 
the integrals 


(2.45) 


and 


(2.46) 


as being the continuum versions of eqns (2.42) and (2.44), where 
we have set a =1/v2r for aesthetic reasons of symmetry. These 
come as a linked pair, and define a Fourier transform and its inverse; 
which one is called which is quite arbitrary. 

While the exponents of a Fourier transform and its inverse must 
have opposite signs, their precise definitions are a matter of conven- 
tion; the choice of a is up to us, for example. If the wavenumber is 
taken to be 1/\ instead of 27/\, as done by spectroscopists, then the 
exponents will be +i2zkx and neither integral will have a scaling 
term: 


f(x) = J F(k) e?"**dk and F(k)= I Hae adp 


where k is measured in cycles per unit length, typically cm~!, rather 
than the SI radians per metre. 

Although our goal is to gain a physical insight into Fourier trans- 
forms, we first need to discuss some of their formal properties. Basic 
symmetries are a good place to start, as the most common one is the 
continuum analogue of eqn (2.43): 


f(x) = f(2)* <=> F(—-k) = F(k)*, (2.47) 
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g(x) 


which states that the Fourier transform of a real function is ‘con- 
jugate symmetric’. If one of them possesses a symmetry about the 
origin, then so too will the other: 


f(x) = ee > F(k) = Eri (2.48) 


Equations (2.47) and (2.48) can be combined to show that the Fourier 
transform of a real and symmetric function is also real and even, 
whereas that of a real and antisymmetric function is imaginary and 
odd; this is equivalent to eqn (2.36). 

The substitution of k=0 in eqns (2.46) and (2.47) reveals F(0) to 
be proportional to the area under the curve y=f(x), 


F(0) = Az J t) dz, (2.49) 


and necessarily real if f(x) =f(x)*. It will equal zero if f(x) =-—f(—x). 
Technically, the integral of the modulus, |f(x)|, must be bounded (or 
finite) if its Fourier transform is to exist everywhere; this is known 
as the Dirichlet condition. 


2.4.1 Convolution theorem 


One of the most useful results in Fourier theory concerns the convo- 
lution of two functions. Mathematically, the convolution of g(x) and 
h(a) is defined by 


CO 


g(x) ® h(x) = [se h(a—t) dt, (2.50) 


=0p9 


where g&h is read as ‘g convolved with h’, and physically represents 
a ‘blurring’ of g(x) by h(x). This can be understood from the example 
of Fig. 2.9, where g(x) consists of four spikes, or 6-functions, and 
h(x) is a broad asymmetric function. The convolution is carried out 


h(x) 
A 


Fig. 2.9 The convolution of the spiky function g(x) with the broad asymmetric function h(x): f(x) = g(x) 8 h(x). 


by replacing each of the the sharp peaks in g(x) with scaled copies of 
h(x) and adding together the four contributions; those from the two 
closely spaced components in the middle, shown by dotted grey lines, 
combine to give a resultant function where the constituent doublet 
is no longer resolved clearly. Although it’s not as easy to visualize 
it the other way around, eqn (2.50) can equally be thought of as the 
blurring of h(x) by g(x). 

The convolution theorem states that the Fourier transform of the 
convolution of two functions is proportional to the product of their 
Fourier transforms: 


f(z) = glz) @h(z) <=> F(k) = Van G(k)xH(k) |, (2.51) 


where F(k), G(k) and H(k) are the Fourier transforms of f(x), g(x) 
and h(x), respectively, according to eqn (2.46). Given the reciprocity 
between a Fourier transform and its inverse, 
fz)= g) xhi) = Fk ore G(k) @ H(A). (2.52) 

The power of eqn (2.51) will be illustrated in a physical sense in the 
next section, and throughout this book, but its computational benefit 
stems from the fact that it’s much easier to multiply functions than 
to convolve them. To work out g(x)&h(x) numerically, for example, 
its quicker to use a fast Fourier transform (FFT) computer subrou- 
tine to calculate G(k) and H(k), and inverse Fourier transform their 
product, than to compute the integral of eqn (2.50) directly. 

Putting k =0 in eqn (2.51), and interpreting F(0), G(0) and H(0) 
with eqn (2.49), shows that the area under the convolution is equal 
to the product of the corresponding individual integrals: 


CO CO CO 


J [sœ enc) ax = [so dz x fo dt. (2.53) 


=09 =00 =09 


This can be seen from the example of Fig. 2.10, where an array of 
different shaped peaks is convolved with a Gaussian. Although the 


g(x) h(x) 
A A 


f(x) 
A 
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g(x) 8 h(x) = h(x) 8 g(x) 


g(x) @ h(x) = 


[oa H(k) ** dk 


Fig. 2.10 The convolution of a function with an array of different shaped peaks, g(x), with a Gaussian, h(x). 
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(x —zo) Q h(x) = h(x —zo) 


f(a)* f(a) = |f(x)|* > 0 


The Dirac 6-function 


A Dirac 6-function, ô(x— xo), is a sharp spike of unit area at a given loca- 
tion, £o; its simplicity as a ‘point impulse’ makes it a useful test object for 
studying equations that model physical situations. Mathematically, it is 
defined by 


0(2 25) = 0 i mra amal J 8-2.) dr = l, 


and can be thought of as the limiting form of a variety of functions as they 
become ever narrower. Of these the most straightforward is a rectangular 
column of width €, centred on x = zo, and height 1/e; this acquires the 
properties of ô(x— xo) in the limit of e — 0. An important corollary of the 
above definition is 


b 


io) M =< a <n 
[Po A =o) dz = (2.54) 


0 otherwise , 


a 


so that integrals involving a 6-function are easy to evaluate. 


two spikes on the left of g(x) merge into one in f(x), because they 
are very closely spaced compared with the width of h(x), the areas 
of the various components in the blurred output are proportional to 
those of the input signal. The amplitudes of the narrowest peaks are 
affected the most, since their relative spreading is the greatest as a 
result of the convolution; the slowly varying parts of the structure 
change the least. 


2.4.2 Auto-correlation function 


The last Fourier concept that we need to consider concerns the auto- 
correlation function, or ACF, which provides information on the dis- 
tance distribution of the various structures in f(z). Mathematically, 
the ACF of f(x) is defined by 


ACF(z) = J f(t)“ f(a+t) dt, (2.55) 
and is real if f(x)“ = f(x). Although this looks like a self-convolution, 


or f(x) @f(—2), it’s not the best way to think about eqn (2.55). The 
ACF is largest at the origin, 


CO 


ACF(0) = IKORO dt, 


=p 


Fig. 2.11 An f(x) consisting of four sharp peaks and its auto-correlation function. 
The spike at the origin of the ACF should be three times higher than drawn, and has 
been suppressed for clarity. The relationship of the closest and farthest peaks in f(z) 
to their corresponding mutual contributions in the ACF is indicated. 


because everything correlates with itself. The value of the ACF at 
a distance L away from the origin is calculated by multiplying f(x) 
with a copy that’s displaced by L relative to it, f(L +), and integrat- 
ing the product; its magnitude is a measure of how much structure 
there is in f(x) separated by a distance of L. This can be under- 
stood most easily by considering the ACF of a function that consists 
of a few sharp peaks, such as that shown in Fig. 2.11. Basically, two 
spikes at x, and x, in f(x), with amplitudes A, and A,, will contribute 
a symmetric pair of very sharp components at + (x,—,), and mag- 
nitude A, A, towards the ACF of f(x); they will also add an amount 
A?+ A? to the ACF at the origin. 

The reason for discussing the ACF is its linear relationship to the 
modulus of a Fourier transform: 


ACF(z) = I |F(k)| r? dk, (2.56) 


where F(k) is given by eqn (2.46). While a Fourier transform and its 
inverse contain the same information, albeit in different ways, and 
it’s possible to switch between one and the other through eqns (2.45) 
and (2.46), the situation becomes less straightforward if only |F(k)| 
is available. We can begin to appreciate the problems caused by such 
a loss of the Fourier phase by comparing the relative complexity of 
the ACF with f(x) in Fig. 2.11. The ACF, which is directly available 
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ACF(—a) = ACF(z)* 
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ACF 


Fig. 2.12 An f(x) containing a variety of peaks and its auto-correlation function. 


from |F(k)| through eqn (2.56), is much harder to interpret in terms 
of the underlying structure; for a diffuse case, such as that in Fig. 
2.12, it’s almost impossible. 


2.5 Fourier optics and physical insight 


So far, we have discussed Fourier transforms in a largely abstract 
context. Now let’s try to gain some physical insight into their proper- 
ties with the aid of diffraction experiments familiar from high school 
physics. First, though, we need to establish the link between optics 
and Fourier transforms. 

The geometry of the diffraction experiment is shown in Fig. 2.13, 
where a travelling plane wave passes through a set of slits and pro- 
duces a pattern of dark and light bands on a very distant screen. We 
have made the problem one-dimensional for simplicity, but will indi- 
cate its generalization later. The nature of the aperture is defined by 
the function A(x), which describes how much light passes through 
it at position z; this is called the aperture function. It usually only 
takes values of zero or one, corresponding to complete opaqueness 
and transparency respectively, but it could in principle be complex 
with 0 < |A(x)|< 1. 

To calculate the diffraction pattern, the principle of superposition 
tells us that we need to add up all the waves that emerge through 
the aperture. The amplitude of the contribution from the narrow 
region between x and z + Az is proportional to A(x) Az, but what 
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Fig. 2.13 The geometry for Fraunhofer diffraction by a one-dimensional aperture, 
A(x); the interference pattern of interest, I(q), is projected onto a distant screen. 


about its phase ¢? That depends on both xz and the angle of propaga- 
tion relative to the incident wave, 0, as well as the time t. The phase 
will be invariant with position parallel to the incoming wavefront, 
but will gain a relative factor of 


ho = (25) x sin 0 


in the direction of 0 due to the associated path difference of x sin 0. 
Hence, the complex contribution to the resultant wave is 


Ad = Yo A(z) À Ag, 


where g=27sin@/\ and the temporal variation has been absorbed 
into the ‘constant’ of proportionality, Vo. The diffracted wave, w, is 
the sum of all such terms; in the limit Az — 0, it becomes the Fourier 
transform of the aperture function: 


CO 


Yla) = Vo i} A(a) el? da. (2.57) 


— Co 


Thus we met Fourier transforms a (very) long time ago but did not 
realize it! Before reminding ourselves of the results from elementary 
diffraction experiments, and trying to understand them in terms 
what we’ve now learnt about Fourier transforms, we need to make a 
few qualifying remarks. 

The first point is essentially a technicality, but the above analy- 
sis assumes that we are considering Fraunhofer diffraction. This 
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CoO 
[oe =g 0 dz =e 
= 00 


cos20 = 2 cos? — 1 


is the limit where the projection screen is so far away that all the 
waves reaching a particular point can be considered to be travelling 
in parallel directions. The equations becomes more cumbersome 
when this approximation does not hold, and leads to the theory of 
Fresnel diffraction. 

The more serious point of note is that the observed, or measured, 
diffraction pattern is not the complex function 7(q) but its intensity, 
or modulus-squared, I(q): 


(a) = Wl = d(@) va. (2.58) 


The difficulties caused by such a loss of phase information, in terms 
of ascertaining the aperture function from its diffraction pattern, 
have been alluded to in Section 2.4.2, but we will encounter them 
again throughout this book. 


2.5.1 Young’s double slit 


A first introduction to interference experiments usually involves a 
Young’s double slit. This consists of a pair of very narrow slits that 
are separated by a distance d, and give rise to a diffraction pattern of 
uniformly spaced dark and light bands which become closer together 
as d increases. Let’s try to understand this theoretically by using 
eqns (2.57) and (2.58). 

The aperture function for a Young’s double slit can be modeled 
by two 6-functions located at a distance of d/2 on either side of an 
arbitrarily defined origin, 


A(z) = 6(a—§$) + 5(@+$), 


and is plotted in Fig. 2.14. Since 6-functions are easy to integrate, 
from eqn (2.54), the Fourier transform of A(x) is readily shown to 
yield 


Yla) = vo (ela? + e iad ) 
= We 2 cos(48) ; 


where we have used eqn (2.32) in writing the second line. The prod- 
uct of this diffracted wave with its complex conjugate, ~(q)", leads 
to the prediction 


I(q) « feos(4¢)]° x 1+ cos(qd) , (2.59) 


where all the multiplicative prefactors not involving q, such as [Yo 7 


have been omitted and a trigonometric double angle formula used 
on the far right-hand side. This pattern of ‘uniform cosine fringes’ is 
plotted in Fig. 2.14. 
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Fig. 2.14 The aperture function for a Young’s double slit, A(x), its Fourier transform, (q), and the diffraction pattern, I(q). 


The theoretical result in eqn (2.59) is consistent with the experi- 
mental observations: the dark and light bands are equally spaced, of 
uniform intensity and become closer together in inverse proportion 
to the distance d between the slits. The last feature is a universal 
property of Fourier transforms: the length scales which character- 
ize a function and its Fourier transform are inversely related to each 
other. This leads to the use of the terminology reciprocal space when 
referring to the Fourier domain. 


2.5.2 A single wide slit 


Another common interference experiment involves a single wide slit 
that gives rise to a diffraction pattern where the intensity of the 
light bands diminishes rapidly away from a central bright region, g p 
which is itself twice as broad as the rest. Let’s also try to understand 
this theoretically. 
If we take the z-origin to be in the middle of the slit of width w, 
then the aperture function becomes 


1 of |z|\<5 5 
A(z) = 


0 otherwise, 


and is plotted in Fig. 2.15. According to eqn (2.57), therefore, 


w/2 
pla) = Yo J elf? dg. 
—w/2 

This Fourier transform is easy to evaluate, because the integration d (oH) = per? 
of an exponential is straightforward, and yields da u 

iqxz J] ®/2 

yla) = Vo E | = us (claw? — eiaw/?) l 
1g —w/2 1q 


The difference of the imaginary exponentials on the far right-hand 
side can be recognized as being equal to 2i times sin(qw/2) from 
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Fig. 2.15 The aperture function for a single wide slit, A (x), its Fourier transform, 7(q), and the diffraction pattern, I(q). 


da —1las @—0 


sinc = 


[5(2-ma) el? dz = ead 


eqn (2.32). With this substitution, the modulus-squared ofy(q) leads 
to the prediction 


1 — cos(qw) 


a (2.60) 


2 

I(q) « È sin (4) x 
which is shown in Fig. 2.15 and consistent with the sinc-squared be- 
haviour of the observed diffraction pattern. We again see the inverse 
relationship between the width of the aperture function and the 
spread of the diffraction pattern: as one of them becomes broader 
the other gets narrower. 


2.5.3 A diffraction grating 


A diffraction grating is an aperture consisting of a large number of 
thin, parallel and equally spaced lines. In one dimension, it can be 
modelled as a periodic array of 6-functions: 


A(x) = ` d(a—md) , 


m =— Co 


where d is the distance between the grating lines. Swapping the or- 
der of integration and summation, and using eqn (2.54), the Fourier 
transform of eqn (2.57) reduces to 


W = % >> eam, 


m =— CoO 


The nature of ~(q) becomes apparent once we realize that it’s pro- 
portional to the sum of complex numbers that are of unit magnitude 
but varying phase. They will add up coherently if the product qd is 
an integer number of 27, yielding a huge resultant sum, but cancel 
out otherwise. Hence, 


yla) x >> d(q—na) , (2.61) 


n =—Co 
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Fig. 2.16 The aperture function for a diffraction grating, A(x), its Fourier transform, ~(q), and the diffraction pattern, I(q). 


where qo = 27/d. The diffraction pattern has the same structure 
as the grating, therefore, but the spacing of the lines is inversely 
related to d (Fig. 2.16). 

In terms of the physical set up of Fig. 2.13, where q =?2rsin0/À, 
sharp bright lines are seen when 


nà = dsinéd, (2.62) 


for n=0, +1, +2,..., +Nmax, Where the trigonometric constraint that 
Isin 0| < 1 imposes a cutoff on the highest observable order nmax. If 
the spacing of the diffraction grating is known, such as 500 lines per 
millimetre (so that d =2 um), then eqn (2.62) provides the basis for 
an accurate measurement of the wavelength of the illumination. If 
white light is used for the experiment, then the intense central line 
is accompanied by increasingly dispersed rainbows for the higher 
orders; this is because each of the wavelengths that makes up white 
light satisfies eqn (2.62) for a slightly different angle 0 for a given 
value of n40. 


2.5.4 The convolution theorem in action 


Although a real diffraction grating isn’t infinite as assumed above, 
we expect the analysis to be a very good approximation for one that 
is sufficiently large. The case of a grating of limited extent w can be 
addressed by combining the results of eqns (2.60) and (2.61) through 
the convolution theorem: as the aperture function can be expressed 
as a product of an infinite grating with line spacing d and a sin- 
gle slit of width w, as illustrated in Fig. 2.17, the Fourier trans- 
form of the finite grating is equal to the convolution of the Fourier 
transforms of the infinite grating and the single wide slit. The re- 
sultant diffraction pattern is simply that of the infinite grating but 
with each of the 6-functions replaced by a narrow sinc-squared func- 
tion, as shown in Fig. 2.17. A qualification is in order here, in that 
I(q) x |G(q)|? @ |H(q)|? is only an approximation (albeit a good one); 
strictly speaking, I(q) x |G(q) 9 H(q)|?. Given the inverse relation- 
ship between the length scales of a function and its Fourier trans- 


27 sin 0 _ 2mN 


A d 


A(x) = g(x) x h(x) 


~ pla) = Yo G(q) 8 H(q) 
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Fig. 2.17 The diffraction pattern from a grating of limited extent, w, can be evaluated from a knowledge of the Fourier trans- 
forms of an infinite grating, with line spacing d, and a single slit, of width w, through the use of the convolution theorem. 


A(z) = g(a) 8h(zx) 


"~ Yla) = Po G(q) x H(q) 


k = (ke, ky, kz) 


form, the width of the large diffraction peaks tells us about the size 
w of the grating whereas the distance between them indicates the 
d-spacing of its lines. As the number of grating lines goes up, so that 
the ratio w/d increases, the principal peaks become narrower and 
more low-level wiggles appear between them. 

The convolution theorem also enables us to ascertain the diffrac- 
tion pattern for a pair of broad slits from the results of eqns (2.59) 
and (2.60). Taking each to be of width w, and separated by d, the 
aperture function can be seen as a convolution of an ideal Young’s 
double slit with a narrow but finite single slit, as in Fig. 2.18. Since 
the Fourier transform of the former is then equal to the product of 
those of the latter, the intensity of the uniform cosine fringes that 
we'd expect from a perfect Young’s double slit is modulated by a 
slowly varying sinc-squared function. 


2.5.5 Multi-dimensional generalization 


Having illustrated Fourier transforms and the use of the convolu- 
tion theorem with one-dimensional versions of familiar high school 
experiments, let’s indicate the multi-dimensional generalization of 
eqn (2.57). A closer examination of Fig. 2.13 reveals q to be the z- 
component of the wavevector k of Section 2.1.1: 


k= (25) sinf = q and k; = (25) cos 6, 


2.5 Fourier optics and physical insight 45 


A(x) 
A 
x 
Kq) eo, 
OC 
>q >q 


h(x) 


>q 


Fig. 2.18 The diffraction pattern from a pair of slits of width w and separation d can be evaluated from a knowledge of the 
Fourier transforms of a Young’s double slit, of spacing d, and a single slit, of width w, with the convolution theorem. 


where we have taken z to be the original direction of propagation, 
from the aperture to the projection screen, and |k|=27/A. With this 
observation, it seems plausible that the two-dimensional diffraction 
pattern, I(k,,k,), from an aperture in the x-y plane, A(x, y), with y 
coming out of the page in Fig. 2.13, might be given by the modulus- 
squared of 


(ke, ky) = vo f IEG alkes +kuy) dr dy. (2.63) 
This double integral, over the surface area of the aperture, simpli- 


fies to the product of two one-dimensional integrals if the aperture 
function is separable: 


(bay hy) = Yo fAs(a)el dx f Anly) he" dy 


if A(x, y)= A(x) A.(y). The substitution of either 6(y) or a constant 
for A,(y), and the properties of 6-functions, allows us to confirm that 
eqn (2.63) reduces to the one-dimensional form of eqn (2.57) if the 
aperture is either infinitesimally thin or invariant with respect to y. 
Strictly speaking, 


woui(ke) as A>A,(x)d(y) and podi(kz) d(ky) as A>A (zx) 


[ima = 2r 0(q—qo) 


= oo) 
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From Sivia and Rawlings (1999), 
Foundations of Science Mathematics, 
Oxford Chemistry Primers Series, 77. 


————————————SSSS E) 
Multiple integrals 


In ordinary integration, we are concerned with the area under the curve 
y = f(x). Many functions of interest in real life entail several variables, 
and multiple integrals are a natural extension of the one-dimensional 
ideas to deal with multivariate problems. 

To get a feel for how multiple integrals arise, let’s consider a couple of 
physical examples. Suppose that we wish to calculate the force exerted 
on a wall by a gale. If the pressure P was constant across the whole face 
with area A, then the total force is simply P x A. With a varying pressure 
P(x,y), the answer is not so obvious. This situation can be handled by 
thinking about the wall as consisting of many small square segments, each 
with area ôx dy, so that the total force is the sum of all the contributions 
P(az, y) 6x dy; in the limiting case when 6x—0 and dy—0, we have 


Force = [fren dzdy 


wall 


where the double integral indicates that the infinitesimal summation is 
being carried out over a two-dimensional surface (in the x and y direc- 
tions). Incidentally, if the wall does not have a conventional (rectangular) 
shape then its area can be calculated similarly according to 


Area = J| dav: 


wall 


The double integral is also called a surface integral. 

Another illustration is provided by quantum mechanics where the 
modulus-squared of the wave function, |Y(x,y,z)|*, of an electron (say) 
gives the probability density of finding it at some point in space. The 
chances that the electron is in a small (cuboid) region of volume 6x dy 6z 
is then |1)(x,y, z)|? dz dy dz. Hence, the probability of finding it within a 
finite domain V is given by 


Probability = a lb(x,y,z)|? dx dydz, 
V 


which is known as a triple, or volume, integral. 


and demonstrates the reciprocal Fourier relationship between the 
widths of A, (y) and ~,(k,,) in the limit of complete invariance versus 
a 6-function. A careful consideration of the situation, in a manner 
analogous to that used to derive eqn (2.57), shows eqn (2.63) to be 
the correct two-dimensional extension. 

The most common case of two-dimensional diffraction is from a 
circular hole, but a rectangle is easier to deal with analytically. This 
is because the aperture function of the latter, which is equal to one 
inside the rectangle and zero outside it, is separable and yields a 
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Fig. 2.19 Diffraction patterns from two-dimensional apertures: (a) a rectangular 
opening of size Dz by Dy, and (b) a circular hole of radius R. 


Fourier transform that is just a product of the now familiar sinc 
functions in k, and ky; its modulus-squared is shown in Fig. 2.19(a). 
The evaluation of the integral of eqn (2.63) is less straightforward 
for a circular aperture, but the resultant diffraction pattern is plot- 
ted in Fig. 2.19(b). It is circularly symmetric, depending only on 
k?+kč, and is similar to a sinc function in the radial direction; the 
behaviour is formally governed by a J, Bessel function. The central 
bright region is called an Airy disc, and its spread is the basis of the 
resolution formula of eqn (1.8). 

Having seen the formulae for the Fourier transforms of one- and 
two-dimensional functions, in eqns (2.45), (2.46) and (2.63), we can 
state the /-dimensional generalization succinctly by using vector 
notation: 


CO fC CO 


f(r) = any f J zr fr% een (2.64) d?k = dks dky dk, 


and ker = krx + kyy +kzz +: 


(2.65) d?r = drdydz 


where r = (x,y,z,...) and k = (kz, ky, kz,...) have M components, 
with corresponding hyper-volume elements d“”r and d™k. We re- 
iterate that a Fourier transform and its inverse come as a linked 
pair, but which one is called which is arbitrary. Their precise defini- 
tions are also a matter of convention. No multiplicative prefactors 
are required if the wavevector is specified in cycles rather than ra- 
dians per unit length, for example, when the k in the exponents is 
replaced with 27k. 
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2.6 Fourier data analysis 


The analysis of data from X-ray and neutron scattering experiments 
is similar to the task of making inferences about the aperture func- 
tion from its diffraction pattern. If we knew that A(x) consisted of 
a small number of slits, n say, of equal spacing d, as in Fig. 2.17 
with w =(n—1)d, then an examination of the width and separation 
of the principal peaks in I(q) readily provides the desired parame- 
ters n and d. In less well informed circumstances, however, all we 
have to go on is the relationship between A(x) and I(q) enshrined in 
eqns (2.57) and (2.58). How can the data then be analysed and what 
difficulties are likely to arise? 


2.6.1 The phase problem 


Ignoring matters of practicality for the moment, the most relevant 
mathematical operation that can be performed on a diffraction pat- 
tern is a Fourier transform: 


j J- fiw g ka dk x j fof aar A(r+t) d™t, 


=p —coO = —Co —co —co 


giving the M-dimensional generalization of the ACF of eqns (2.55) 
and (2.56), where 


I(k) = |y(k)|? and (k) = vf ffa e 


= =Q — Co 


Whereas the correspondence between A (r) and the complex function 
p(k) is one-to-one, implying that there is no loss of information in 
the transformation, the same is not true of A(r) and the real and 
positive diffraction pattern I(k). Only the auto-correlation function 
of A (r) can be ascertained unambiguously from I(k), and we have 
already seen, in Figs. 2.11 and 2.12, how much more difficult it is to 
interpret the ACF than A(r). 

The simplest way of appreciating how the lack of phase, arg {4(k)}, 
in a diffraction pattern results in a loss of uniqueness about A(r) is 
to consider the Fourier transform of an aperture function that has 
been shifted by ro, 


Wo j / : J set eker dMr = w(k) eikere | 


= =g =00 


which differs from that of A (r) only through an additional factor of 
—k.r, in its argument; the intensity, or modulus-squared, 


[pao ir] [pio iee] = plo pt eiee ole, 


Fig. 2.20 The phase problem: (c) has the Fourier phases of (a) and the Fourier am- 
plitudes of (b), while (d) has the phases of (b) and the amplitudes of (a). 


is unchanged. The amplitude of a Fourier transform is, therefore, 
insensitive to translation. The same is true of the inversion of a real 
function, so that A(r) and A(—r) give identical diffraction patterns 
if A(r)=A(r)*. This provides another elementary demonstration of 
the loss of uniqueness without the Fourier phase. 

The importance of the phase of a Fourier transform can be illus- 
trated dramatically with graphical examples of the type shown in 
Fig. 2.20. Here two photographs, pertaining to any subject or scene, 
are Fourier transformed numerically and the phases of one assigned 
to the amplitudes of the other. Each of these hybrid sets of complex 
coefficients is then inverse Fourier transformed, and the resultant 
pictures examined visually. Instinctively we would guess that the 
outcome of this numerical experiment will be a complete mess, for 
why should the Fourier phases of one distribution of light intensity 
have anything to do with the amplitudes from another; if not, we 
might expect to see some sort of mixture of the two sources. What we 
find in practice is certainly degraded compared to the originals, but 
each output only resembles the scene which contributed the Fourier 
phase with no hint of that from which the amplitudes were taken. 
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A(z) = A(z)" = > I(q) =I(-@) 


I(q) 
N 


m q 
Umax Tmax 
ACF 
N 
D => xr 


It seems that most of the structural information in a Fourier trans- 
form resides in its phase; and since this is missing in diffraction 
data, it makes their analysis difficult in general without additional 
prior knowledge. 


2.6.2 Truncation effects and windowing 


Even when the main interest is in the ACF of the aperture function, 
and the absence of Fourier phase is not a problem, the limited sam- 
pling of a diffraction pattern causes difficulties in practice. In the 
simplest one-dimensional case, when I(q) is available only within 
the range |q| < qmax, the truncated Fourier integral 


dmax dmax 
Jo eiat dg = 2 f Ha) cos(qx) dq, (2.66) 
—dmax 0 


where the cosine equivalent on the right assumes that the aperture 
function is real, yields an estimate of the ACF that is corrupted by 
ripples with a characteristic wavelength of 27/qnax. These artefacts 
can be understood with the aid of the convolution theorem, and Fig. 
2.21, by considering eqn (2.66) to be the Fourier transform of the 
product of the full but unmeasured diffraction pattern, J(q), and a 
‘top-hat’ function of width 2 qmax, H(q). The result is, therefore, the 
true but unknown auto-correlation function, acf, convolved with a 


J(q) H(q) 
A 


acf 


Fig. 2.21 The Fourier transform of a diffraction pattern of limited q-extent, I(q), yields an ACF of the aperture function which 
is corrupted by truncation ripples associated with qmax; their origin is easily understood from the convolution theorem. 


sinc function, h(x), whose central peak has a full width at half max- 
imum (FWHM) of about 3.8/qmax- 

The messy picture due to the truncation ripples can be cleaned up 
greatly by multiplying the incomplete diffraction pattern, I(q), with 
a window function, W(q), which decays smoothly from one at the 
origin to zero around +qmax, before the (inverse) Fourier transform 
is calculated. This is illustrated in Fig. 2.22 with the ubiquitous 


Gaussian, 
2 


W(q) = exp (- 4) , (2.67) 
2 g? 

whose standard deviation o was chosen somewhat arbitrarily as 
qmax/2. The resultant auto-correlation function, denoted by acf, is 
said to be a filtered version of the ACF given by I(q). The suppression 
of the truncation ripples can also be understood from the convolution 
theorem, which tells us that acf(x) = ACF(x) ® w(x), because the 
subsidiary oscillations are averaged out through a blurring with the 
filter w(x). The latter is just the Fourier transform of the windowing 
function, W(q), with 


29 
w(x) « exp(—" 7 ) (2.68) 


for the case of eqn (2.67). Although the spurious peaks and troughs 
are increasingly reduced as w(x) becomes broader, requiring W(q) 


J(q) I(q) 
A A 
q q 
~ Umas Tmar E Q maz Q as 
acf ACF 
A A 
D > Tt 
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FWHM = v8ln2 o x 2.350 


Fig. 2.22 Truncation ripples can be suppressed by multiplying the diffraction pattern, I(q), by a ‘window’ function, W (q), 
which decays smoothly to zero over a g-range comparable to that of the measurements, before calculating the (inverse) Fourier 
transform; the resultant ‘filtered’ auto-correlation function can also be understood from the convolution theorem. 
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Data analysis: a Bayesian tutorial, 
Sivia (1996), Oxford University Press; 
2nd edition (2006) with Skilling. 


to be narrower, the drawback is that intrinsically sharp features 
of the auto-correlation function are smeared out even more. Thus 
filtering is a matter of striking a balance between the suppression of 
the truncation ripples and a further loss of resolution. A variety of 
windowing functions have been developed to try to best achieve this 
end. 

Diffraction measurements are often unattainable at low q-values 
as well as high ones, so that I(q) is available only within the range 
dmin < |q| S qmax. The truncated Fourier integral, 


dmax 
I(q) cos (qx) dq, 


dmin 


then yields an estimate of the ACF of the aperture function that is 
plagued by both low and high frequency artefacts. Structure in A(z) 
which is longer than around 27/qnin or narrower than about 27/qmax 
cannot be inferred reliably. The difficulty caused by a lack of I(0) is 
easiest to appreciate since, from eqns (2.57) and (2.58), it relates to 
the area under A(z): 


2 


I(0) « IEG da] œ Jaro) dz, (2.69) 


where the equivalent expression on the far right follows from the 
x-integral of eqn (2.55), or the inverse of eqn (2.56) with k (or q) set 
to zero. As the truncated Fourier integral implicitly assumes that 
I(0)=0 if qmin 4 0, the resultant ACF will contain equal amounts 
of positive and negative structure to ensure a net null area. Apart 
from at the origin, g=0, the diffraction pattern is insensitive to the 
addition of a constant to A(x) or its ACF. 


2.6.3 Noise and probability theory 


In practice, the analysis of a diffraction pattern is also limited by 
the noise in the measurement process and the extent to which the 
details of the experimental setup are understood and modelled. The 
task is not really one of calculating an inverse Fourier transform, 
which isn’t possible in a strict mathematical sense, but a matter of 
making inferences about the aperture function given incomplete and 
noisy data. The tool for dealing with and quantifying uncertainty is 
probability theory, as developed by Laplace (1812), and the reader is 
referred to Sivia (1996) for an extended tutorial. A brief overview is 
given below. 

The generic data analysis problem can be stated as follows: Given 
a set of N measurements {Dx}, for k =1,2,3,...,N, and some per- 
tinent information H, what can we infer about the object of inter- 
est A(x)? The Fourier nature of the experiment enters the analysis 


through the equation that predicts the k*t? data point, F;,, for a given 
aperture A(x): 


F, = f(a), k) , (2.70) 
where 
a 2 
Iq) = | po fao) ell? dg (2.71) 


and ‘f’ is the function that models the measurement process. In 
the simplest case Fk =I(q,), but a more common situation involves 
the convolution of I(q) with an instrumental response, or resolution, 
function, R(q), and the addition of a slowly varying background sig- 
nal, B(q): 


CO 


Fy = J (q) R(qe—@) ABU (2.72) 


=O 


The noise, or the expected mismatch between Fk and D;, is usually 
quantified through an error-bar, op, which is a shorthand way of 
assigning a Gaussian probability for the likelihood of the kt? datum: 


(Dy —Fr)* 
20,2 


prob( Dr |A() Ht) (2.73) 


1 
= —— exp 
Op V 27 


where the vertical bar ‘|’ means ‘given’ (so that all items to the right 
of this conditioning symbol are taken as being true) and the comma 
is read as the conjunction ‘and’. A knowledge of eqns (2.70)-(2.72) 
and, hopefully, the related resolution and background functions, as 
well as the error-bars, is implicitly assumed in H. If the N measure- 
ments, {D;,}, are independent, in that the noise associated with one 
is unrelated to that of another (as far as H is concerned), then their 
joint likelihood is just the product of the individual contributions: 


prob({Dx}] A(z), H) = il prob( Ds | A(x), H) . 
k=1 


In conjunction with eqn (2.73), therefore, the likelihood function for 
the data can be written as 


prob( {D} A(x), H) x exp(~ xX) , (2.74) 


where 
N 2 
E = (2.75) 
k=1 


is the sum of the squares of the normalized residuals. 


2.6 Fourier data analysis 53 


prob(D|F,c) 


54 Waves, complex numbers and Fourier transforms 


Our inference, or ‘state of knowledge’, about the aperture function 
in the light of the data and H is not encapsulated by the likelihood 
function but by the posterior probability, 


prob (A@) | {Dx}, H) s 


where the positions of {D;} and A(x) are reversed with respect to the 
conditioning symbol. The A(x) which gives the largest value for the 
posterior probability can be regarded as the ‘best’ estimate of the 
aperture function, while the range of the alternatives that yield a 
reasonable fraction of the maximum probability gives an indication 
of the uncertainty. The likelihood function is related to the posterior 
probability through Bayes’ theorem, 


prob ({Dx} | A(x), H) x prob (Al) |z) 
prob( {D} | H) 


2 


prob (A@) | {Dx}, H) = 


where the second term in the numerator is called the prior probabil- 
ity, and represents our state of knowledge (or ignorance) about the 
aperture function before the analysis of the data, and the denomi- 
nator usually constitutes an uninteresting proportionality constant 
(required for normalization) since it doesn’t explicitly mention A(z). 
The latter plays a crucial role when comparing different assump- 
tions or models, however, such as H; versus H>, and is referred to 
as the ‘global likelihood’, ‘prior predictive’ or simply the evidence in 
that context. 

A quantitative discussion of the aperture function is contingent 
on a parametric description of A(x), of course, and its choice is a 
reflection of the information H at hand. If it were known that we 
were dealing with a pair of slits of equal finite width, as in Fig. 2.18 
for example, then A(x) would be defined by the two parameters d 


and w as follows: 
ies 1 if |at¢|<¥, 


0 otherwise , 


where d>w> 0. If very little information was available, then we 
might use the formulation 


M 
A(x) = > cj G3 (x) , 


where the M coefficients, {c;}, define the aperture function through 
a linear combination of suitable basis functions, G;(x). Although a 
larger value of M provides greater flexibility in the range of A(x) 
that can be modelled, a more careful choice of the G,(x) can re- 
duce the number required and, thereby, aid many aspects of the data 
analysis task. 


(b) 


> Re 


Fig. 2.23 The likelihood constraint on the value of a Fourier coefficient given (a) a 
noisy measurement of its modulus-squared and (b) additional phase information. 


When the aperture function is defined by only a few parameters, 
the data tend to impose a strong constraint on their allowed values. 
The likelihood function dominates the posterior probability in this 
case and the prior, which is relatively broad to represent ignorance, 
is largely irrelevant. For the likelihood function of eqns (2.74) and 
(2.75), therefore, the optimal parameters of A(x) are those that yield 
the smallest value of y?; this is called the least-squares estimate. 
When little is known about the aperture function beforehand, and 
its description entails a large number of parameters to reflect this 
initial ignorance, it becomes important to give due consideration to 
the prior to encode whatever weak information is available about 
A(x). For example, positivity, bounds, local smoothness and so on. 
This leads to the use of regularization procedures, or constrained 
optimization, such as maximum entropy. 

The computational task of finding the maximum of the posterior 
probability distribution and determining its spread, in the space of 
the parameters used to describe A(x), can be a very challenging one. 
If we have a good initial estimate of the optimal solution, then an 
efficient gradient algorithm, such as Newton—Raphson, can often be 
employed. Otherwise we may need to use the slower, but more ro- 
bust, Monte Carlo methods. These sorts of practical considerations 
can make it tempting to ignore the noise and limited coverage of the 
data, and try to emulate an inverse Fourier transform in some way. 
For the ACF, and with appropriate filtering, this can provide a useful 
quick method for a qualitative analysis. 

As mentioned earlier, the loss of the Fourier phase in diffraction 
experiments causes a serious difficulty for ascertaining the aperture 
function. We can begin to appreciate this from a probabilistic point 
of view by considering the constraint that the likelihood function 
imposes on the value of a Fourier coefficient, 7(q,), when only its 
modulus-squared can be measured; this is shown pictorially in Fig. 
2.23(a). Unlike the case of Fig. 2.23(b), where additional phase in- 
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formation is available, the permissible values of Y%(qx) do not shrink 
towards a unique point in the Argand plane even in the limit of 
noiseless data; they reduce instead to a thin circular region, with a 
phase ambiguity of 27 radians. 


2.7 A list of useful formulae 


To finish off this principally mathematical chapter, covering the im- 
portant prerequisites for scattering theory, we give a list of some 
useful formulae. 
Powers and logarithms (see Section 1.1) 
aa” = atn CaN — qv 
a =l, a™=1/a and a= #/a (integer p). 


x 


psa => a= logy) 


log(AB) = log(A) + log(B) and log(A/B) = log(A) — log(B) , 


log(A’) = GB log(A) and log, (A) = log (A) x log,(a) . 
Trigonometry 
a s 0s 2 = l ar a =- "o : 
© r- cosec@ ’ Pr sech’ = æ cos6  cotô ` 


r? +y? =r? <> sin?’ð +cos?0 = 1 
tan? 6 +1 = sec? 0 
cot? 0 + 1 = cosec? 0 
sin(A+B) =sinAcosBtcosAsnB => sin26 = 2 sin cos0 


cos(A + B) = cosA cos B F sinA sinB => cos20 = cos? 0 — sin? 0 


2 sin A cos B = sin (A+B) + sin(A—B) 
2 cosA cos B = cos(A+B) + cos(A—B) 
—2 sinA sinB = cos(A+B) — cos(A—B) 


a b ae: 
snA sinB  sinC 


ec = a + b* —2abcosC 


Power series, sums and expansions (see Section 2.2.4) 


ee ui L? x 
sme =e 7 aT 
N oe 
coss = i= ta a 
, a gi 
e7 = exp(zr) = letat n 
Po E gi 5 
log,(1+x) = In(1+2) = p= Ta a a (|z|< 1) 
= —1)(p-2 
(14a)? = 1+pa+ PED gt p PPP ps4.. (In|<1) 
(a+b) = y "Cg e 
k =0 
n n 
` a+(k-1)d = 5 [20+ (n-1)d 
k=1 
n i n 
a = aU ”) as n—oo and |r|<1 
= 7 — 


Vectors (see Section 2.1.1) 


axi(bxe) = 


(aec)b — (aeb)c 


Complex numbers (see Section 2.2) 


sinha = £ > = —isin(iz) 
T ~E 

cosha = £ = = cos(iz) 

een sinh x e2*_] 

anhr = —— = =~ 
cosh x ert. 1 
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n! = nx(n—1)x(n—2)x---x3x2x1 


| 

ad 
N 
H 
(oe) 


2 
Vie sip aa 


2 8 
n n! 
ta = (2) = tet 
E~ k k! (n—k)! j 
N(N+1 
1$+2434--+N = wit) 
y 
\ A 4 y=sinhx 
\ i 
y=coshr\, yy 
NG oy, y=tanhz 
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Differentiation and integration (see Sections 2.1.2 and 2.3.1) 


y=. (28) My aay) p ow 
dz? da \dz dx” drede 7 7 dx dz/dy ` 
LT agp = Z o 
-~ "d aa 7 dz w) | v2 , 
n ii hie. ail d” “ dfu d®-Fy dy dy du dy/dt 
(uv) uv +2u’v+u-v a (uv) » Cr de® de® a7 u a 


a) Z | #0) L lta) < 
g” na” | Iing 1/x sinh x cosh x 
e7 e7 log , (x) (xlna)! cosh x sinh x 
a” alna | e* =2 ġe? tanh x sech? z 
y=sind <— > @=sin ‘y sin £ cosx | sino ba (1-2?) 1”? sinh! x (1422) 1”? 
cosy —sing | cos™! g (1 2)71/2 cosh a (a7 I 
tany sec?z | tanta (1+3 tanh! x (1-2?) : 
erf(—x) = — erf (x) feta = VT exf (2) and f ieat = 2r ô(xr— To) 
0 —oo 


(see Section 2.4.1) 


Fourier transforms (see Sections 2.4 and 2.5.5) 


CO CO 


F(k) = [ro etd, <— f(r) = + [PW elke dk 
f(x) F(k) | f(x) F(k) 
f{(r+2%o) elk Zo F(k) = ik F(k) 

dz 

f(x) ® g(x) G(k) F(k) | f(a) g(x) aq G(k) ® F(k) 
Ô(£r— zo) g tte. T 27 0(k) 
d(a+d)+6(x—d) 2cos(kd) | 6(a+d) —6(x—-d) 2isin(kd) 
l ee en 2 sin(kw) 2 d(a—nd) 2 6 (k — 25) 

e°? for £20 1 a alk 

0 otherwise atik | 70+?) D 

—o* k?/2 


2.7.1 Dimensional analysis 


Theoretical analysis involves the use of equations for understanding 
physical phenomena in a quantitative manner. Since the derivation 
of the relationships can be mathematically complicated, it’s always 
worth carrying out ‘sanity checks’ on the formulae before applying 
them in detailed calculations; this is a good way of detecting alge- 
braic mistakes and typographical errors. A requirement to simplify 
to familiar or intuitive results in elementary cases is one part of 
this approach, but the need for ‘dimensional consistency’ provides 
an even more basic test. 

Physical parameters related to mechanics can be analysed in terms 
of their associated dimensions of length, L, time, 7, and mass, M. 
Thus velocity, being a displacement per unit time, has dimensions 
of LT~'; acceleration, being the rate of change of velocity, LT ~?; 
force, from Newton’s second law of motion, M LT~?; energy, from 
work = force x distance, M L? T~?; and so on. While it may be nec- 
essary to add charge, Q, and temperature, O, to the basic list for 
dealing with electromagnetism and thermodynamics, the balance 
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MOE < œ 


f(z)=0 for-2=200 


a>O 
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implied by an = symbol means that the dimensions on both sides 
of an equation must match up (or else something has gone wrong). 
Indeed, the dimensions of every component separated by a + or — 
must be the same, and the arguments of functions, such as exp, log, 
sin and cos, should be dimensionless. 


Part II 


Elastic scattering 


The basics of X-ray and 
neutron scattering 


Having covered the preliminaries, we are now in a position to start 
discussing the basic concepts of X-ray and neutron scattering. For 
simplicity, we will initially restrict ourselves to the case where there 
is no exchange of energy in the process. 


3.1 An idealized scattering experiment 


The scattering of an X-ray photon, or a neutron, by a sample is char- 
acterized by the resultant change in its momentum, P, and energy, 
E. This is shown schematically in Fig. 3.1, where a particle incident 
with a wavevector k; and angular frequency w; emerges with a final 
wavevector ky and frequency wy. The fundamental relationships of 
quantum mechanics, eqns (1.12) and (1.13), enable the momentum 
transfer to be expressed as 


P = hk; -kp = AQ, (3.1) 
where the Planck constant ñ = h/27 and 
Q =k; —k;, (3.2) 
and, similarly, the energy transfer as 
E = ħw where wW = Wy — wf: (3.3) 


The momentum and energy gained by the scattered particle is equal 
to that lost by the sample, of course, and vice versa. The defini- 
tions of P and F as ‘incident minus final’, rather than the other way 


around, is a matter of convention. 
4 Wy 


=y Sample 


Fig. 3.1 A schematic representation of a particle being scattered by a sample. 


27 
k| = — 
Ik| = = 
w = ZTV 
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P = (Pr, Py, Pz) 


Q = (Qz,Qy,Qz) 


An ideal scattering experiment consists of a measurement of the 
proportion of incident particles that emerge with a given energy and 
momentum transfer. This is encoded in a four-dimensional function 
S(P, E), traditionally called the ‘scattering law’, where the vector P 
has three components; it is frequently written in terms of Q and w, 
as S(Q, E) and S(Q,w). To keep the discussion as simple as possi- 
ble in the early stages, we will begin by restricting ourselves to the 
special case of elastic scattering, 


E=0 <= w=0]|, (3.4) 


where there is no exchange of energy. This reduces the dimension- 
ality of the problem by one, because the scattering law is then only 
a function of momentum, or wavevector, transfer: 


SaO) = 8(Q,w=0). (3.5) 


The condition of eqn (3.4) means that the modulus of the wavevector, 
and hence the wavelength å, is unchanged upon scattering, 


(3.6) 


but the precise reason is slightly different for X-rays and neutrons. 
For a (massless) photon, the dispersion relation c= w/k, where c is 
the speed of light and ‘= |k|, implies that 


B= hu = hek] and Ep= hwp = helki, 


which leads to eqn (3.6) following the elasticity requirement that 
E= Eş. For a neutron, of mass ma, the energy transfer is given by 
the change in its kinetic energy 


2 2 
p liki? _ Laks! 
2a 2x 


which also yields eqn (3.6) for E=0. 


3.1.1 Elastic scattering and momentum transfer 


When talking about a scattering experiment, the picture that comes 
to mind is of particles in an incident beam being deflected in differ- 
ent directions. How does the geometry of the process relate to the 
wavevector transfer Q? 

The vector diagram for an elastic scattering event is shown in Fig. 
3.2, where an incoming particle is deflected through an angle of 26; 
the factor of two is conventional and algebraically convenient. From 
eqn (3.6), the triangle representing eqn (3.2) is isosceles with the 
two equal sides being of length 27/\. Elementary trigonometry then 
leads to the result 


3.1 An idealized scattering experiment 


Fig. 3.2 The vector diagram for elastic scattering, |k;| = [ky , through an angle of 20. 


o An sin 0 


Q= 5 , (3.7) 


where Q= |Q]. Although this links the magnitude of the momentum 
transfer to the wavelength and scattering angle, it doesn’t capture 
all the spatial aspects of the problem. 

To define the deflection of the scattered particle completely, a sec- 
ond angle, ¢, is required for the measurement of rotation about k;. 
This is illustrated in Fig. 3.3, where the z-axis has been chosen so 
that it is parallel to the incident beam: 


k; = (0,0, 34). 


The angles 26 and ¢ determine the direction of the outgoing X-ray, 
or neutron, in much the same way as latitude and longitude give 
the location of a point on the surface of the Earth (in this case, with 
respect to the ‘north’ pole and a reference meridian). More precisely, 
a careful examination of Fig. 3.3 allows kẹ to be expressed as 


k; = == (sin 20 cos @, sin 20 sin ¢ , cos 20), 


> Q 


Fig. 3.3 The scattering geometry in both Cartesian and spherical polar coordinates. 


2 
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sin 20 = 2 sin @ cos 0 


cos 20 = 1 — 2sin76 


Oe) 


sin 20 6g 620 


where |k;+| =27/A. Trigonometric double-angle formulae then allow 
the wavevector transfer, k;—k;, to be written as 


4r sin@ 


Q= 


(— cos ĝ cos, —cos@sing, sin 0) , (3.8) 


which satisfies eqn (3.7) as required. 


3.1.2 The differential cross-section 


The basic quantity that a scattering experiment aims to measure is 
the fraction of incident particles that emerge in various directions, 
as defined by the spherical polar coordinates 20 and ¢ in Fig. 3.3. 
If the neutrons or X-ray photons form a steady stream of incoming 
particles, then the incident flux, ®, is usually specified by their num- 
ber per unit time and per unit area perpendicular to the ‘flow (with 
SI units of m~*s~'). Their rate of arrival in the direction of 20 and 
Q, into a detector that subtends a small solid angle of AQ steradians 
(sr) at the sample, can similarly be stated as R per unit time (s~!). 
The ratio 


Number deflected by (20,6) per unit solid angle — R(20,¢) /AQ 
Number of incident particles per unit area of beam p 


is then closely related to the desired scattering function. It is known 
as the differential cross-section, da/dQ, and has SI units of m?sr~! 
or an area per unit solid angle. 

The precise definition of the differential cross-section used varies 
slightly depending on the field of application, but is often quoted per 
atom, or per molecule, through a division by the number of scatter- 
ing units of interest, N, in the sample: 


Solid angles 


Radians are a dimensionless measure of planar angles, giving the length 
of the circular arc that is created when a radius is rotated in one dimen- 
sion. Steradians are the corresponding unit of solid angle, Q, which give 
the area of the spherical shell fragment generated when a radius is wig- 
gled in two dimensions: 


__ Area of spherical shell fragment 
E Radius? l 


The solid angle representing all directions, therefore, is equal to the area 
of a sphere divided by the square of its radius, or 47 sr. An infinitesimally 
small area dA subtends an element of solid angle 


Q 


dQ = — (3.9) 


at a distance of r, which can be integrated up for finite regions. 
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do R(26, ¢) 
in NAQ (3.10) 
where, strictly speaking, the equality holds in the limit of AQ — 0. 
Two common alternative forms of da/dQ have a factor of V (m?) or 
M (kg) in the denominator of eqn (3.10), instead of N, so that the 
differential cross-section is normalized by the volume or the mass, 
respectively, of the sample. This leads to different dimensions and 
units for do/dQ, and care must be exercised in ascertaining, or stat- 
ing, which convention has been employed. 


3.1.3 Elastic versus total scattering 


The information contained in the differential cross-section can eas- 
ily be translated into an equivalent Sa(Q) of eqn (3.5) as long as 
all the scattering interactions can be assumed to be elastic and the 
incident beam contains particles of the same (known) wavelength å; 
it simply requires the transformation of the (26, ¢) directions into 
corresponding values of wavevector transfer with eqn (3.8). While 
a monochromatic beam can usually be arranged, the absence of any 
energy transfer is the fundamental assumption that is made when 
equating da/dQ with S,)(Q). 

In reality a sample will exhibit both elastic and inelastic (E £0) 
scattering interactions, and they cannot be distinguished without a 
measurement or a selection of the energy of the outgoing particle. 
Diffraction experiments generally don’t involve any energy discrim- 
ination, however, and rely on the property that the scattering tends 
to be predominantly elastic: 


do do Ra(26, Q) 
Tel pea ae R= Ra+t Pine 
dn (aa). NSAN Rn ne 


The small difference between the ‘total’ and the elastic differen- 
tial cross-sections, (da/dQ),...,, is the focus of inelastic studies. For 
most diffraction work it is just a nuisance that has to be minimized 
through experimental design, by using low temperatures for exam- 
ple, but is otherwise simply ignored. The exception is liquids and 
amorphous analysis, where an attempt is made to model and cor- 
rect for the discrepancy. 


3.2 Scattering by a single fixed atom 


The above discussion of the differential cross-section was concerned 
with the description of scattering data. Now let’s try to understand 
how the structure of the sample, at the atomic level, relates to the 
measured deflections. After all, this theoretical question is at the 
heart of this book. 
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The simplest place to start is a single ‘fixed’ atom. How would it 
scatter a beam of X-rays or neutrons? Well, a steady stream of par- 
ticles of wavelength A travelling in the z direction can be described 
mathematically by the complex plane wave 


bi = de®, (3.12) 


which is invariant with respect to x and y, where the incident flux 
=||”, k=27/A and i? =—1; the temporal variation of the wave 
and the phase offset have been subsumed into the argument of qo. 
By a fixed atom we mean that it cannot move or exchange energy in 
any way. The interaction is elastic, therefore, and the wavenumber 
of the outgoing particles is also k. 

After interacting with the atom, the particles move radially out- 
wards. If the origin is centred on the atom, so that the wavevector 
ky is parallel to the displacement vector r, then the sinusoidal part 
of the scattered wave is 

olkyer — elhr 
where r = |r|. Unlike the incident beam, which is collimated, the 
number of outgoing particles per unit area will fall with the distance 
r according to the inverse-square law. Hence, the scattered wave will 
take the general form 
ikr 


e 


We = Wo f(A, 0) (3.13) 


for r >0, where the function f(\, 0) tells us about the chances that a 
particle of a given wavelength is deflected in a certain direction (the 
symmetry of the problem indicating that it will not depend on the 
angle ¢); the Y, ensures that the rate of scattering is proportional 
to the incident flux. As the characteristics of f(A, 0) are different for 
X-rays and neutrons, this is a useful point to draw the distinctions 
between them. 


ikz 


Fig. 3.4 A steady stream of neutrons being scattered elastically and isotropically 
(with f = constant) by the nucleus of a single fixed atom at the origin. 
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3.2.1 Nuclear scattering lengths 


The neutron case is usually straightforward because f(A, 0) tends to 
be invariant with respect to wavelength and scattering angle: 


P= =, (3.14) 


where the constant b is called a scattering length and the minus sign 
is a matter of convention. The name is apt since f(\,@) must have 
the same dimensions as the r in the denominator of eqn (3.13) for 
wy to be consistent with ~;. The simplicity of eqn (3.14) is subject to 
certain provisos, the most important of which is that it applies to the 
scattering of neutrons by the nucleus of the atom; their magnetic 
interaction with the orbital electrons will be considered shortly in 
Section 3.2.3. 

The isotropic nature of the nuclear scattering stems from the very 
short range of the related strong interaction by comparison with the 
wavelength of thermal neutrons: around 10~'*m instead of 10710m. 
As far as the neutrons are concerned, the scattering is from a ‘point 
source’. The deflections occur equally in all directions, therefore, 
with the situation being analogous to the diffraction of water waves 
in a ‘ripple tank’ by a narrow aperture. The independence from A 
implied by eqn (3.14) holds for most nuclei, to a good approximation, 
in the neighbourhood of thermal wavelengths. The few instances 
where this is not so, such as for cadmium, gadolinium and boron, 
are associated with high likelihoods of absorption rather than scat- 
tering. All nuclei exhibit this phenomenon in the vicinity of certain 
wavelengths or energies, called resonances, but not many do so for 
the à of interest; given the rarity, a detailed discussion is delegated 
to Appendix B for simplicity. 

While scattering lengths are complex numbers in principle, their 
imaginary parts are generally so small that the b can be treated as 
real. Absorbing nuclei are the exceptions, but eqn (3.14) does not 
apply to them anyway because both the modulus and argument of 
f(\,0) varies strongly with wavelength close to a resonance. The 
magnitude of b determines the strength of the scattering, whereas 
its sign indicates whether the incident and outgoing waves are in or 
(180°) out of phase. 

The scattering interaction quantified by the constant b is not well 
understood, but it depends on the makeup of the nucleus and the 
orientation of its spin (if not zero) relative to that of the neutron. 
This means that the scattering lengths 


e are isotope specific, 
e have two different values for nuclei with a non-zero spin and 


e do not vary with atomic number in a simple or monotonic way. 


The nucleus of ordinary hydrogen, for example, which consists of 
a proton, has a spin of 1/2 like the neutron. Quantum mechanics 
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t Koester (1977) 


Ab = 4/((b-(b))’) 


shows that two such fermions can be combined to yield a resultant 
spin of 1 in three different ways and 0 in just one configuration. The 
scattering lengths for these triplet and singlet states, denoted by + 
and —, respectively, aret 


b+ = 1.085 x107'4m and = b> = —4.750x107'*m. 


On average, therefore, a hydrogen nucleus (or a proton) has a scat- 
tering length of 


W= at st 2 S054 x10 im (3.15) 
and a standard, or root mean square (rms), deviation of 
Ab = y (b2) — (b)? = 2.527 x1074m (3.16) 


where 
(b?) = 3 (b+)? + 1(b-)? = 6.524 x 10778 m?. 


Deuterium, which is the heavier isotope of hydrogen with unit spin, 
by contrast, has scattering lengths oft 


b+ = 0.953x107'4m and b7 = 0.098 x107!fm 


The arithmetic of quantum mechanical spin 


When a particle of spin Sı combines with another of spin S2, the resultant 
is specified by the operator sum 


S = Di I O2 


This deceptively simple expression, however, hides a surprising amount 
of complexity! For our purposes here, it will suffice to say that the total 
spin S can take integer-spaced values between 


Si+ S2 > S 2 |S1— S2] 
and that each case is associated with the 2+1 possibilities 
S, = S,S-1, S—-2,---,2-S,1-S,-S, 


where the z-axis is traditionally chosen to be the reference direction along 
which a discriminating magnetic field is applied. Thus, for example, a 
neutron (Sı = 1/2) can interact with a nucleus of unit spin (S2=1) to yield 
the quartet 


and the doublet 
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Fig. 3.5 The erratic variation of the neutron scattering lengths, b, with atomic mass, 
u, across the periodic table. The average over the nuclear—neutron spin combina- 
tions, (b), and its standard deviation, Ab, are indicated by the points and shaded re- 
gions, respectively; the most abundant isotopes are marked by filled circles, whereas 
the least common and the absorbing nuclei have been omitted. 


where the + and —, respectively, refer to the resultant quartet and 
doublet states with spins of 3/2 and 1/2. On average, therefore, 
deuterium has 


b) = 2 bt + b7 = 0.668x10° 4m (3.17) 
3 3 


and a standard deviation of 
Ab = È (b+)? + T — (b)? = 0.403x10-!@m. (3.18) 


These examples illustrate both the spin and isotope dependence of 
neutron scattering lengths; their unpredictable variation through 
the periodic table can be seen in Fig. 3.5. 


3.2.2 Atomic form factors 


The characteristics of f(A, 0) for the scattering of X-rays by an atom 
are very different to those considered for the neutron, because the 
relevant interaction is a long-range electromagnetic one with the 
orbital electrons rather than a short-range force exchanged with its 
nucleus. Assuming the common situation where the wavelengths do 
not correspond to energies at which significant absorption occurs, 
with the resonant case being addressed in Appendix B, f(\,@) is a 
real-valued function which 


e diminishes monotonically with increasing 0 and decreasing å, 
e has the same sign for all elements and 
e has a magnitude proportional to the atomic number Z. 


The last two properties are easy to understand since the strength 
of the scattering depends on the number of orbital electrons, which 
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Fig. 3.6 The X-ray form factor for carbon has a maximum proportional to its atomic 
number, 6, and decays to zero as the wavevector transfer, Q = 4r sin 0/A, increases; 
re is the classical radius of the electron. The equivalent variation of f(A, 0) for the 
scattering of neutrons by a 1°C nucleus is indicated with a dotted line. 


is equal to the number of protons in a neutral atom, and all have 
the same (negative) charge. The reason for the first feature will 
become clearer later but, for now, let’s go back to the analogy with 
the diffraction of plane water waves by an aperture in a ripple tank. 
As the gap in the barrier is widened, the emergent wave pattern 
changes from semi-circular to more planar with a little curvature at 
the edges; an increasing proportion of the wave is propagated in the 
forward direction, therefore, and less at higher angular deflections. 
This is pretty much what is happening, with the effect becoming 
more pronounced as the size of the atom becomes larger with respect 
to the wavelength of the X-rays. 

The exact nature of the decay of f(A, 0) with increasing scattering 
angle, 20, and decreasing wavelength, å, depends on the element, 
but it is a function of the ratio sin@/ and has a shape similar to 
that shown for carbon in Fig. 3.6. In contrast to the invariance of 
nuclear scattering lengths of eqn (3.14), this fall-off in f(A, 0) with 
the modulus of the elastic wavevector transfer, Q = 47sin@/X, is 
known as the atomic form factor: 


f(\,0) = Z g(Q) Tre (3.19) 


where Z is the atomic number, g(Q) decays from one at the origin 
to zero as Q — œ and re is the classical radius of the electron, or the 
Thomson scattering length, 


2 
re = ——— = 2.818 x107} m. 


— Ateg Mec? 
The parameters for an analytical approximation to Z g(Q), consist- 
ing of a constant plus the sum of four origin-centred Gaussians of 
different widths and amplitudes, can be found in Section 6.1.1 of 
Volume C of the International Tables for Crystallography (2006). 
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Before moving on, we should note that our elemental formulation 
of scattering in eqns (3.12) and (3.13) is inadequate in one impor- 
tant respect: it does not to take into account the transverse nature 
of electromagnetic radiation and its associated polarization. Like 
waves on a string, the sinusoidal fluctuations of the electric (and 
magnetic) field lie in a particular direction in the plane perpendic- 
ular to the propagation vector k. This interacts with the orbital 
electrons and gives rise to an emergent wave whose amplitude, ~,, 
is scaled by an electric dipole-dependence on the angles in Fig. 3.3, 
P(@,¢), as well as the form factor of eqn (3.19). Rather than modi- 
fying eqns (3.12) and (3.13), however, it’s simpler to compensate for 
the shortcoming by applying a multiplicative correction, P?(6, ¢), to 
the intensity, yl, at a later stage: 


P?(6,¢) = 1 — sin?20 cos? ¢, (3.20) 


where the incident electric field is taken as being aligned with the 
x-axis (6 = 0). For the three most common cases, this ‘polarization 
factor’ reduces to 


1 for O — +1/2, 
P?(@) = cos? 26 foro = 0 or 7, (3.21) 


+ (1+ cos?2@) for an unpolarized source. 


The formula for the source of unpolarized X-rays, where the incident i 27 
electric field is oriented randomly for different photons, is obtained (P*0)) = = i P?(6, 6) do 
by averaging P?(6, ¢) in eqn (3.20) over the angle ¢ (between 0 and 0 


27 radians). 


3.2.3 Magnetic form factors 


Neutrons also behave like little bar magnets because they possess a 
magnetic dipole moment, un, by virtue of their quantum mechanical 
spin: 

Un = —1.913 ux, (3.22) 


h 
where a 


P 


by = 5 = 5.051 xio" JT 

is called the nuclear magneton. This property is anomalous from a 
classical point of view, where magnetic dipoles are associated with 
small loops of current, as a neutron has no charge. With a spin of 
one half, it can adopt only two configurations relative to an external 
magnetic field: either parallel or antiparallel, known as ‘spin up’ and 
‘spin down’ respectively. The former state is favoured energetically 
at very low temperatures, but a random (unpolarized) mix is difficult 
to avoid under ambient conditions. 
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M = M; +M. 


= (IMI cos a) Q +M 


f/f, 
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Fig. 3.7 The magnetic form factor for neutrons (in blue) falls off more rapidly with 
increasing wavevector transfer, Q = 4r sin 0/X, than the X-ray case (dotted grey). The 
example shown is for the Mn?* ion, with fo = f(\,9=0). 


In addition to the nuclear scattering discussed in Section 3.2.1, a 
neutron can be deflected through a magnetic interaction between its 
dipole moment and that of the constituents of the sample. Since the 
principal source of the latter comes from unpaired electrons in the 
outer shells of atoms, the magnetic scattering of neutrons 


e is ion specific, 
e is subject to a form factor and 
e does not increase monotonically with atomic number. 


The magnetic form factor arises for the same reason as the X-ray 
case of the preceding subsection, but the fall off with Q is even more 
pronounced as only the outermost electron orbitals are involved; this 
can be seen in Fig. 3.7. The parameters for an analytical approxima- 
tion to the magnetic form factors can be found in Sections 4.4.5 and 
6.1.2 of Volume C of the International Tables for Crystallography 
(2006). 

When playing with bar magnets as children, we soon discover that 
the force between them depends on their relative orientations. It is 
repulsive if neighbouring ends are alike, for example, and attractive 
when they are of opposite polarity. The same is true at the micro- 
scopic level, and results in the magnetic scattering of a neutron by 
an ion being highly anisotropic. Whether or not this is reflected in 
the differential cross-section depends on the alignment of the mo- 
ments of the various atoms of the sample: do they form a correlated 
pattern or are they oriented randomly? We'll say more about that 
in Chapter 7, but note here the general property that the effect of a 
magnetic field is always at right angles to its direction. In terms of 
a neutron which has been deflected with a wavevector transfer Q, 
this means that it experienced the component of the magnetization 
M that was perpendicular to Q: 


Mi =M- (M.Q)Q, (3.23) 
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where Q is a unit vector in the direction of the wavevector trans- 
fer. The magnetic interaction can be ‘turned off’, therefore, if the 
moments of all the atoms can be aligned parallel to Q; this may be 
achieved through the application of an external magnetic field. An 
alternative method of separating the nuclear and magnetic contri- 
butions to the scattering is to make measurements above and below 
the critical temperature at which magnetic order is destroyed; this 
is called the Curie or Néel temperature, depending on the magnetic 
structure of the sample. 

Although most neutron experiments are done with an unpolarized 
beam, which contains equal numbers of spin up and spin down neu- 
trons, the selective use of a given orientation can be very beneficial 
in the study of certain types of magnetic samples. A full polariza- 
tion analysis is much more challenging, of course, and entails the 
additional determination of the spin direction of the outgoing neu- 
tron. X-rays also interact with the magnetic elements of a sample, 
but their use has been limited until recently owing to the weakness 
of the related scattering (~10* lower rate than that of Section 3.2.2). 
The availability of high intensity synchrotron sources through the 
1990s has seen a rapid expansion in the field of X-ray magnetic 
studies, which provides valuable extra insights; see, for example, 
Lovesey and Collins (1996). 


3.2.4 Scattering cross-sections 


Before moving on to consider scattering from a large assembly of 
atoms, let’s make the connection between the function f(A, 6) of eqn 
(3.13) and the corresponding cross-section o. The idea of the latter is 
quite simple: If the relevant scattering rate from an atom, summed 
over all directions, is R (s~' in SI units) when the incident flux is ® 
(stn ?), then this is equivalent to the hit-rate of a classical target 
with an area of o= R/ (m?) perpendicular to the incoming beam. 


Rate of scattering = Incident flux x Cross-sectional area. 


The number of particles of wavelength \ deflected in the direction of 
20 and ¢ in Fig. 3.3, per unit time and area, is given by the modulus- 
squared of eqn (3.13): 


® 
del = oe of = SIE, 


where ® = |Yo|". Hence the scattering rate, taken over all angles, is 
given by the double integral 


T 2T 
R= I |u|? dA, dA = r?sin 20 do d20 


20=0 ¢=0 
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1 barn = 10778 m? 


(barns) 


H 
D 


Ocoh 


1.76 
5.59 


O incoh 


80.27 
2.05 


with dA = r? sin 20 dọ d20 being the element of area generated when 
a radius r pointing in the direction of (20,¢) is moved through in- 
finitesimally small angular displacements of d26 and dọ. It leads to 
the relationship 


o (A) = 2r | IE(A,0)|? sin 28 420, (3.24) 


20=0 


where the prefactor of 27 comes from the evaluation of the inte- 
gral with respect to ¢. For the neutron case of nuclear scattering in 
eqn (3.14), when f(\,0) is independent of à and 0, the 20-integral 
readily yields the link between the scattering length b and the cor- 


responding cross-section: 
o = 4r |b}? | (3.25) 


A barn is a common unit for such areas, and is 107° times smaller 
than a square metre. 

In Section 3.2.1, we noted that the scattering lengths of isotopes 
whose nuclei have non-zero spin are specified by an average value 
and a standard deviation, 


b = (b) + Ab, 


which is also appropriate for a sample containing a natural mixture 
of isotopes for a given atom. A rearrangement of this equation, in- 
volving squaring and the taking of means, returns the variance of 
eqn (3.16) in the form 


(b?) = (b) + (Ab)’, 


and enables the average scattering cross-section to be written as the 
sum of two quantities: 


(Oscat) = 4r (b?) = Ocoh + Cineoh 5 (3.26) 
where the coherent and incoherent cross-sections are defined by 
Tech = Ar (b)? and: = Wines = 4r (Ab). (3.27) 


We'll discuss the physical significance of ccon and Gincon in the next 
section but, as their names suggest, the former gives rise to interest- 
ing structure in the scattering data whereas as the latter generally 
adds a featureless background signal. The values of (b) and Ab cal- 
culated in eqns (3.15)—(3.18) show that hydrogen, unlike deuterium, 
suffers from a very large incoherent cross-section. 

In the preceding paragraph, we have implicitly assumed that the 
scattering lengths are real. While this is usually the case, the imagi- 
nary part is not negligible close to an absorption resonance. Whether 
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it be for neutrons or X-rays, the rate of absorption processes can also 
be characterized by a cross-section, cabs, Which varies strongly with 
à in the neighbourhood of a resonance. It is related to the decrease 
in the intensity, J, of a beam as it passes through a distance l of the 
corresponding material by 


I = I 677s! (3.28) 


where J, is the incident intensity and n is the number of atoms per I EE er N 
unit volume. The product n cabs is called the absorption coefficient n = veil a E 
and its reciprocal is a mean free path, or the average distance that a 

particle travels before being absorbed. 


Atomic mass m 


3.3 Scattering from an assembly of atoms 


Having considered the scattering of X-rays and neutrons by an iso- 
lated fixed atom, lets see what happens when a large assembly of 
them is encountered; the latter is, of course, a model for the sample. 
If the incident beam is represented by a complex plane wave with 
wavevector k; = (0,0, k), as in eqn (3.12), a particular atom, labelled 
by the index j, will make a small contribution to the scattered wave, 
[vy], , of the form in eqn (3.13): 


elkse(r—R;) 


[Sep]; = wo elk £5 (A, 0) (3.29) 


|r—R,| 
where f,(A, 6) defines the relevant interaction characteristics of the 
j*" atom, located at R;, ky is the wavevector of the outgoing parti- 
cle and r is an arbitrary position; the geometry is illustrated in Fig. 
3.8. Although eqn (3.29) is more complicated than eqn (3.13), be- 
cause the atom is no longer at the origin, it corresponds to the same 
physical picture and simplifies to the earlier case when R;=0. The 


Fig. 3.8 The contribution to the scattered wave from atom j, located at R; relative 
to an arbitrarily defined origin somewhere in the sample. 
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advantage of this generalization is that it enables us to add up the 


ôs contributions from all N atoms in the sample and work out the 
net scattered wave wr: 


ie N ciQeR,; 
=e" f; Lo 
wp = poe 2 yA, 4) =r (3.30) 


where everything not involving j has been taken outside of the sum- 

mation and Q=k;—k; is the wavevector transfer of eqn (3.2). While 

this is a straightforward application of the superposition principle, 
there is an implicit assumption that the scattering process is weak. 
That is to say, the scattered wavelets have a negligible effect on the 
incident beam. This simplification is often referred to as the Born, or 
the kinematical, approximation and holds for both X-rays and neu- 
trons in most instances. Events where particles are deflected more 

than once have also been ignored, and this too is conditional on very 
low interaction rates. Depending on the size of the sample, and the 
geometry of the experimental setup, corrections for multiple scat- 
tering and absorption may need to be made, but we will not go into 
such issues. 

The distance to the detectors, where the measurements are taken, 


is much larger than the typical size of a sample. This means that, to 
a very good approximation, 


|r—R,;| = |r| =r, (3.31) 
with the situation being illustrated in Fig. 3.9. In this Fraunhofer 
or far-field limit, the denominator in eqn (3.30) can be replaced with 


r and taken outside the summation. The scattered wave then has a 
modulus-squared of 


Incident beam 
zs] 
a. 
| ; 
4 
ap ¥ 
08 jo 


Sample 


Fig. 3.9 The size of the sample illuminated by the incident beam is usually much 


smaller than the distance at which scattering measurements are taken, so that 
|R;j| << |r| ~ |r—R,| for all j. 
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where ® = |2),|” is the incident flux. Particles of wavelength À are 
deflected elastically into a detector with a small area 6A, in the di- 
rection of 20 and ¢ (with respect to Fig. 3.3), at a rate Rea per unit 
time given by 

2 


N 
Rei(20,¢) = |b? 5A = BIN] S— GA, 8) Eee 
j=l 


where 69 = 6A/r? is the solid angle subtended by the detector at 
the sample. Hence, the differential cross-section of eqn (3.11) can 
be related to the structure of the sample, with an atom of type f; 
located at R}, for 7 =1,2,3,..., N, through 


Ra(20, $) 
, (3.32) ci CoO 


where the proportionality stems from the fact that the right-hand 
side has not been divided by N, or the mass or volume, or whatever, 
of the sample. As noted in Section 3.1.2, the choice of normalization 
varies across disciplines. 


3.3.1 Scattering density and Fourier transforms 


Much of Section 3.2 was devoted to a discussion of f(A, 6), the func- 
tion used to describe the nature of the scattering of neutrons and 
X-rays as a result of their various interactions with an atom. The 
simplest case was that of eqn (3.14), which was attributed to the nu- 
cleus being point-like. By the same token, the decay of f(A, 0) with 
increasing sin 0/A, or Q, in Sections 3.2.2 and 3.2.3 was put down to 
the orbital electrons occupying a space whose dimension was compa- 
rable to the wavelength of the probing X-rays and neutrons. While 
arguments based on an analogy with the behaviour of water waves 
in a ripple tank were offered, a deeper understanding was promised 
later; let us now deliver. 

The summary above suggests that the characteristics of f(A, 0) are 
determined by the size and shape of the object responsible for the 
scattering. The latter can be quantified by a scattering length den- 
sity (SLD) function, G(x, y, z), with SI units of n°, of the appropriate 
scattering material; nuclear, electron or magnetization, depending 
on the type of probe and interaction. An infinitesimally small vol- 
ume dV, at a given location R, will then encapsulate a point source 
of isotropic scattering with 


f(A,0) = B(R) dV. 


The substitution of this elemental form of f(\,@) in the preceding 
analysis, and a three-dimensional summation over the x, y and z 
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dV = dR = drdydz 


Fig. 3.10 A two-dimensional SLD func- 
tion, (x,y), consisting of the sum of 
three discrete components. 


arguments of 3(R), leads to the following continuum generalization 
of eqn (3.32): 


where V is the total volume within which scattering may occur. The 
elastic differential cross-section is related to the structure of the 
sample, therefore, through the Fourier transform of its SLD func- 
tion, G(R). 

The connection between the SLD distribution of a given atom, j, 
centred on the origin, 3;(R), and the corresponding f;(A, 0) becomes 
apparent when (3(R) is expressed as the sum of N discrete contribu- 
tions: 


N 
B(R) = X` 6 (R-R,), (3.34) 


j=1 


where R; is the location of the jth atom. The substitution of eqn 
(3.34) into eqn (3.33), and the use of the ‘shifting theorem’ of Fourier 
transforms, 


/ 6;(R—-R,) iWR BR — cians fff BiR) ci QeR GR, 


yields eqn (3.32) with the identification that 


£;(\,0) = f f J a(Ryet?-F GR |. (3.35) 


Since the result of the integral on the right-hand side is a function of 
the wavevector transfer Q, which depends on å, 6 and ¢ in general, 
the absence of ¢ on the left-hand side means that we have assumed 
an intrinsic symmetry for the SLD distribution of the atoms. Indeed, 
the properties of Fourier integrals ensure that the simple spherical 
choice of 6;(R) = 3;(R), where R = |R|, leads to f;(Q) = f;(Q) in 
accordance with the observations noted in Sections 3.2.2 and 3.2.3 
that the form factors vary with the ratio of sin@/A. The reciprocal 
relationship between the width of a function and that of its Fourier 
transform — if one is broad the other is narrow — further explains 
the relative decays of f(A, 0) with Q for the various types of scat- 
tering interactions discussed in Section 3.2: G(R) for a nucleus is 
essentially a 6-function, and so gives rise to scattering that is uni- 
form with respect to Q; the orbital electrons are spread out, and 
yield a Fourier transform of finite extent. 


3.3 Scattering from an assembly of atoms 81 


3.3.2 Temperature and Debye-Waller factors 


So far, we have taken the atoms to be fixed in space. Although this 
is a reasonable assumption at very low temperatures, with T ~0K, 
they oscillate about their equilibrium positions (in a solid) with in- 
creasing vigour as the thermal energy rises. The blurred nature of 
the resultant G(R) can be modelled crudely by convolving the zero- 
temperature SLD function with an isotropic Gaussian whose width 
is proportional to the square root of the temperature: 


R? 


a(R) = | 60) | = 


T= 
where the constant a, with SI units of m? K-t, defines the variance 
of the atomic deviations through 


3aT 
(R°) = 2 , 


and y = (arT) ensures that the volume integral of (R), or 
the total amount of scattering material in the sample, does not vary 
with temperature. In conjunction with eqn (3.33), the convolution 
theorem of Section 2.4.1 then gives 


~3/2 


Sei(Q) = [S.1(Q) | 


2 
x ep(-272 ) , (3.37) 


and tells us that larger angular deflections are suppressed ever more 
as the temperature rises. The multiplicative term on the far right, 
which encodes the exponential fall-off in the scattered intensity with 
Q?, is called a Debye-Waller factor. 

The assumption that all the constituents of a sample vibrate with 
the same average amplitude, and equally in every direction, was im- 
plicit in eqn (3.36). When this is not adequate, because lighter atoms 
are easier to displace than heavier ones, or the bonding breaks the 
spatial symmetry, we can consider the effect of temperature on the 
f(A, 0) of individual atoms before using eqn (3.32) to obtain S,)(Q). 
A convolution based argument similar to the above leads to 

a; TQ? 
£00) = [ao], x eE), 


(3.38) P20 +0 FO 


for j =1,2,3,..., N. Directionality can be built into the formalism 
by generalizing the exponents of the Debye—Waller factors to rep- 
resent ellipsoids in Qz, Qy and Q- instead of spheres, but this re- 
quires the introduction of five extra parameters (like a,;) per atom. 
At high temperatures, the amplitude of the fluctuations can be so 
large that anharmonicities in the restoring force become apparent 
and a quadratic dependence on Q is no longer sufficient. Whatever 
the case, it’s important to note that the decay in the scattered inten- 
sity with increasing sin@/\ for T`>0 arises for the same reason as 
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B(x) 


the X-ray atomic and neutron magnetic form factors of Section 3.2: 
the reciprocal relationship between the width of a function, (,(R), 
and that of its Fourier transform, f; (Q). 


3.3.3 Coherent and incoherent scattering 


In Chapter 2, we saw that Fourier transforms were linear combi- 
nations of sinusoidal functions. Whether such a summation gives 
rise to something interesting depends on the relative weighting with 
which the different waves are added. In terms of eqn (3.33), Sa (Q) 
only has notable structure if G(R) has a certain degree of regularity. 
For the nuclear scattering of neutrons, where the scattering lengths 
vary with the isotopic species and its nuclear spin state, it is ana- 
lytically helpful to decompose the SLD function into an average part 
which shows a discernible pattern, (3(R)), and a set of random de- 
viations, AG(R): 


B(R) = (B(R)) + AB(R) . 


A one-dimensional example of this procedure is shown in Fig. 3.11. 
At T +0K, the well-defined component of the SLD function can be 
approximated by 


N 
= >> (B;) (R-Ry) , (3.39) 
j=l 


where ((3;) is the spin and, if appropriate, isotope abundance av- 
eraged nuclear scattering length of the jth atom, as described in 
Section 3.2.1, which is located at R;. The A(R) contribution is 
different for each neutron, because it depends on the relative orien- 
tation of the nuclear and neutron spins, and so 


(AG(R)) = 0 
with the rms fluctuation at R; being Ab,. 
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Fig. 3.11 A very simple one-dimensional example of how a function, 6(x), can be decomposed into the sum of an average pattern 


showing discernible regularity, (B(x 


)), and a set of randomly located Pactudtionn: Ata), 
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The Fourier transforms of the one-dimensional example of (/3(R)) 
and A(R) in Fig. 3.11, (F(Q)) and AF(Q) respectively, are shown 
in Fig. 3.12. As anticipated, the former displays notable structure 
whereas the latter consists of ‘white noise’. The resultant (spin av- 
eraged) scattering function of eqn (3.33), 


2 
su(Q) « (| (F(Q)) +4F(Q)["). 
can then be expressed as the sum of two terms, 


Se(Q) = [Sa(Q)] a + [Se(Q)| (3.40) 


coh incoh ’ 


because the cross-term averages to zero: 


(Re {(F(Q))” AF(Q)}) = 0. 


Explicitly, the structured part of Sa (Q), or the coherent scattering, 
arises from the SLD function of eqn (3.39), 


[Se(Q)] on <J i [ioe el@°F g3R | , (3.41) 


whereas the incoherent contribution simply adds a constant back- 
ground signal with a magnitude equal to the average value of JAF|’; 
the latter is, in turn, related to the mean incoherent cross-section of 
the sample (through Parseval’s theorem): 


[Se(Q)] incon < ((A8)) . (3.42) 


Unless otherwise stated, it will be assumed that only the coherent 
part of the elastic scattering is of interest . 


(F(q)) AF(q) 
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Fig. 3.12 The Fourier transforms of (B(x )) and A@(z) in Fig. 3.11, (F(q )) and AF (q 


[Zi Zl? = (414+ Z2) (Z1+Z2)" 
= |Z|" + |Z|" 
42 Re {Zt Zo} 


Sei(@) 


) respectively, lead to a resultant scattering 


function, S,.)(q), which is proportional to the sum of (F(q Di and a small constant of magnitude (|AF(q )| Pis 
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3.3.4 Mixed scattering events 


Before moving on to consider sources of X-ray and neutron radiation, 
we should note that the scattering function from a mixture of inter- 
action events tends to decompose in a manner similar to eqn (3.40). 
With unpolarized neutrons, for example, 


Sei(Q) = [Se1(Q)] inc T [Se(Q)] rag ? (3.43) 


where the terms on the right are the independent contributions from 
nuclear and magnetic scattering. 


3.4 X-rays and synchrotron sources 


In principle, the simplest way of producing X-rays is through the 
generation of very high temperatures. All objects give out electro- 
magnetic waves, called black body radiation, but the bulk of these 
emissions occur at a wavelength, max, that depends inversely on its 


temperature 7’: 


2.90 x 1073 
Amax = m, 
T 


where T is in kelvin. This is known as Wein’s Law, and it explains 
why people (and other warm blooded animals) can be detected with 
infrared cameras in the dark and why the glow of metals changes 
from red to white as they are heated. Photons with Amax ~ 1 Å would 
become dominant, therefore, if the temperature could be raised to 
around 30 million degrees. Since this is comparable to the inside of 
the sun, and well above the melting point of any material, this is not 
a very practical avenue. 

Rather than trying to convert thermal energy into X-rays, another 
tack is to make use of kinetic energy. If an electron, having a charge 
of -e, is accelerated towards a metal target by the application of a 
positive voltage V, then eqn (1.12) and the conservation of energy 
require that the frequency v of any emitted photon satisfies 


hv < eV, 


with a shortfall on the left being due to heat dissipation. Equation 
(1.16) allows this to be written as 


he _ 1.24x107° 
A > vy ~ y m, (3.44) 
where V is in volts, and shows that X-rays with atomic wavelengths 
are possible with a modest 10 kV. Indeed, in conjunction with vac- 
uum tubes, this approach has formed the basis of X-ray production 
in laboratories for over a century. 

The actual conversion of the electron’s kinetic energy into a resul- 


tant photon on striking the metal target arises from two different 
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mechanisms. The first is direct, and occurs when an electron (or any 
charged particle) decelerates rapidly. This is called Bremsstrahlung, 
or ‘braking radiation’, and yields a continuum of wavelengths with a 
lower bound given by the equality in eqn (3.44). The second process 
is indirect, and is initiated by the incoming particle knocking out an 
electron from one of the inner shells of an atom in the target. This is 
followed by the relaxation of an electron from a higher energy level, A 
E;, to the more stable vacant position, E;, and is accompanied by N E, 
the emission of a photon with a frequency, vij, corresponding to the : 
change in energy: 


L —e E, 
h vij = Ej= Ez: 
The quantized nature of the atomic orbitals means that this mecha- 
nism produces photons at a discrete set of wavelengths, 
he M2 | M3 
Aij = ; (3.45) ~-,* : Vr 
oe, Ka A 
subject to the constraint of eqn (8.44). The spectrum of X-rays that 
emerge is illustrated schematically in Fig. 3.13. By far the strongest : 
feature is the K, line, at 12, which results from an electronic transi- : 
K ——o E, 


tion from the second lowest energy level to the ground state. Hence, y ' y 
this is the primary wavelength at which everyday X-ray work is done 
in laboratories. 

Although vacuum tube technology has been the mainstay of X-ray 
studies for a century, certain intrinsic features have kept improve- 
ments modest. For example, since most of the input kinetic energy 
is converted into heat rather than photons, the rate at which the 
target can be cooled restricts the output intensity of the source. The 
divergence of the X-ray beam and its fixed wavelength (for a given 
metal target) also impose experimental limitations. A huge step for- 
ward has come with the development of dedicated X-ray synchrotron 
facilities. 


log(I) 


Bremsstrahlung 


hc/eV 


Fig. 3.13 A schematic illustration of the the logarithm of the intensity versus wave- 
length for an X-ray tube. The sharp lines in the K-series involve electronic transitions 
down to the lowest energy level, with a principal quantum number of n = 1; the sub- 
scripts a, 8 and so on denote changes of An =1,2, ..., respectively. 
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Fig. 3.14 Synchrotron emission from 
Cassiopeia A, the supernova remnant 
of a massive star that exploded about 
300 years ago. (Courtesy of Prof. S. F. 
Gull, Mullard Radio Astronomy Obser- 
vatory, Cambridge.) 


When a charged particle is accelerated, it emits electromagnetic 
waves. Bremsstrahlung is an example that involves deceleration. 
Since acceleration can refer to a change in the direction of motion 
as much as speed, electrons moving in circular (or helical) orbits in 
response to a magnetic field radiate; at relativistic speeds, the emis- 
sions are in the X-ray waveband. This is the basis of synchrotron ra- 
diation. It occurs naturally in space, being responsible for the radio 
emission from the Galaxy, supernova remnants and extragalactic 
sources, but was first noticed in circular particle accelerators built 
for high energy physics in the late 1940s; this is the origin of the 
pronoun ‘synchrotron’ for such radiation. 

The early work on synchrotron X-rays was done parasitically at 
facilities designed to study subatomic structure. In this context, the 
radiation is an undesirable byproduct whereby some of the electrical 
energy used to speed up the electrons and positrons (or protons and 
anti-protons) is lost in the process of confining their trajectories to 
a closed orbit with a magnetic field. As this leakage is minimized 
by reducing the curvature of the path, storage rings for high energy 
physics experiments have become ever larger; the LHC, in Geneva, 
has a circumference of 27km! Conversely, a higher radius of curva- 
ture is advantageous if the principal objective is the production of 
X-ray synchrotron radiation. Modern sources typically have a diam- 
eter of a few hundred metres, with the size being dictated by the 
number of beamlines planned for the facility. 

The great success of synchrotron X-rays is primarily down to their 
highly collimated nature, which allows the rate at which photons 
impinge on a sample to be several orders of magnitude higher than 
that from vacuum tubes. This enormous enhancement is due largely 
to relativistic beaming, whereby photons are radiated preferentially 
in the (instantaneous) forward direction as the electrons circulate 
in the storage ring at very close to the speed of light c. A classi- 
cal analogy of this phenomenon is given by a sprinkler that squirts 
water uniformly as illustrated in Fig. 3.15(a). From the perspec- 
tive of someone moving towards the fountain, however, the outflow 
is elongated in the direction of the relative motion; this is shown in 
Fig. 3.15(b). Although this familiar Galilean transformation breaks 
down at relativistic speeds, when v—c and the Lorentz factor 


(a) (b) 


Fig. 3.15 A simple classical analogue of relativistic beaming: (a) the uniform out- 
flow of water with respect to the fountain; (b) the corresponding velocity distribution 
relative to an approaching observer, distorted in the direction of the relative motion. 
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(a) 


(b) 


Fig. 3.16 Polar diagrams of the intensity, r, versus angle, 0, of the radiation emitted 
by a charge accelerated tangentially to its direction of motion from the perspective of 
(a) the rest frame of the charge and (b) an observer in the laboratory. 


y =1/2 
y = ( — = > ds (3.46) 


a Lorentz transformation of the dipole radiation emitted by an accel- 

erated charge in its own rest frame explains why an observer in the _ 

laboratory sees a narrow beam of photons aligned with the forward = > 
direction of the particle. The case of y=3, corresponding to 94.3% of 
the speed of light, is illustrated in Fig. 3.16. The bulk of the emis- 
sion is confined to an angular divergence of 1/y radians, typically 
less than one minute of arc with y ~ 104, which sweeps tangentially 
around the circumference of the storage ring like a highly focused 
search light. 

The main characteristics of the radiation emerging from a circular 
synchrotron are determined by the kinetic energy of the electrons, 
E, and radius of the orbit, R. The Lorentz factor, for example, which 
controls the collimation of the beam, is given by 


y=lit onin, (3.47) 
MeC 
where the numerical approximation on the far right assumes that 
E `> mec? and is given in the common, but non SI, units of GeV. A 
heavier charged particle, such as a proton, would require the input 
of much more energy (about a thousand fold) to achieve the same 
value of y as an electron. An observer in the plane of the orbit sees 
brief flashes of duration 


AT x = «(L)rx 7 


~ 272 2T Dons? 


— 


where the period T ~ 2rR/c and the 27° term is the Doppler fac- 
tor for a source of radiation approaching at relativistic speed. This 
gives rise to a continuous spectrum of photons with a lower bound 
of Amin © CAT and a ‘critical’, or median, wavelength of 


_4nR  0.56R 5 
c — 343 os E3 ’ 


> 


(3.48) 


where the numerical approximation assumes that the radius is given | 
in metres and the energy in GeV. O Ac 
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Fig. 3.17 Average brightness values. 


The final notable feature of synchrotron radiation is its intrinsic 
polarization. Indeed, the detection of a high degree of polarization 
from an astronomical source is a good indication of an underlying 
synchrotron process. In terms of a storage ring, the electric field of 
the emitted photons lies in the orbital plane. 


3.4.1 Wigglers and undulators 


The picture of a perfectly circular synchrotron considered above is 
an idealized one. In practice, a storage ring consists of a combina- 
tion of curved regions, where a uniform magnetic field is applied 
perpendicular to the orbital plane, and straight sections, containing 
magnetic quadrupoles that focus the electron beam and RF (radio 
frequency) cavities which pulse a voltage along their path. The lat- 
ter give an electrostatic speed boost to the electrons, to compensate 
for their gradual slowing down as a result of the energy lost in the 
radiative process. The bending magnets are solely responsible for 
the synchrotron emission in this setup, and our discussion has so 
far been based on them. 

In modern X-ray facilities, known as third generation sources, to 
distinguish them from the dedicated but conventional second gener- 
ation ones, even greater intensity and versatility is achieved through 
the use of insertion devices called wigglers and undulators. These 
consist of banks of magnets, of alternating vertical polarity, that are 
placed in the straight sections of the storage ring to force the elec- 
trons to execute horizontal oscillations. The curvature of the zig-zag 
path induces the synchrotron beam, but its overall fixed orientation 
allows the emissions to build up in that direction rather than just 
flash by as with a bending magnet. The perturbation of the linear 
trajectory is much smaller for undulators than wigglers, and this re- 
sults in a more focused beam with the power concentrated around 
certain well-defined, and tunable, wavelengths; details can be found 
in Als-Nielsen and McMorrow (2001). For these reasons, undulators 
have become the principal source of synchrotron radiation at X-ray 
facilities. 

The huge improvement in the power of X-ray sources can be seen 
from Fig. 3.17, where the average brightness from vacuum tubes, 
bending magnets and undulators is marked on a logarithmic scale; 
for comparison, the values for three everyday light sources are also 
indicated. The formal measure of source quality is called brilliance, 
and is defined by 


Number of photons /second 


Brilliance = 22-2 
(mrad)* (mm? source area) (0.1% bandwidth) 


where the denominator takes into account the cross-sectional area 
from which the emissions occur, the solid angular divergence of the 
beam and the spectral distribution of the photons. Although undu- 
lators have opened up avenues of experimental X-ray science that 
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were inconceivable until the late 20th century, the enormous level of 
flux is not without its challenges: the primary optical components, 
such as monochromators, need to be cooled cryogenically and organic 
samples suffer radiation damage rapidly. 


3.5 Reactors and pulsed neutron sources 


Outside a nucleus, a neutron decays with a half-life of about 15 min- 
utes into a proton and an electron (and an anti-neutrino): 


n —> p+e AP Vex 


While this process can occur within an atom, leading to the ejection 
of the electron as 3-radiation and the transmutation of the element, 
the rate is usually much slower. This stability means that atomic 
nuclei are ultimately the source of neutrons. The question is how to 
get them out? 

Neutrons were discovered by Chadwick, in 1932, following the ir- 
radiation of beryllium with a-particles from polonium (7!°Po). The 
main nuclear reaction involved is 


a? Be = "C44, 


where a-particles are just *He nuclei. The emission rates from such 
(a,n) processes are typically 10° to 10° neutrons per second, which 
is too low for most experimental work. A better source of neutrons is 
provided by fission, whereby a heavy nucleus splits into two lighter 
ones and generates some residual y-rays, neutrons and other sub- 
atomic particles. Uranium enriched with its less common isotope, 
2350 is the principal raw material. After the initial absorption of a 
stray neutron, created by an occasional spontaneous fission or the 
interaction between a cosmic ray and matter, the resultant ?°°U is 
unstable and breaks up in a number of different ways. One possible 
mechanism is 


n + 5U = 2367] * = 134Xe + 1009r + Jii; 


but the alternatives combine to release an average of 2.5 neutrons 
per event. The net surplus forms the basis of a self-sustaining chain 
reaction, where some of the ejected neutrons induce the fission of 
other 7°°U atoms, that allows research reactors to yield a neutron 
flux of up to 1015 s7tem~?. 

Uranium undergoes fission with bombardment by slow neutrons, 
but those liberated in the nuclear reaction are too fast by a factor of 
over a thousand. They are made to lose kinetic energy through colli- 
sions with light atoms, since the exchange is most efficient between 
particles of similar mass, to create the right conditions for a chain 
reaction. The resulting distribution of neutron speeds approaches 
the Maxwell—Boltzmann function of Fig. 1.13 corresponding to the 
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temperature of the moderating material; the latter, usually graphite 
or heavy water (D20), is kept cool at T ~ 300K with an appropriate 
heat transfer system. Luckily, the thermal energy neutrons that are 
best suited to encourage fission are also optimal for use in the study 
of condensed matter at the atomic and molecular level. If either 
longer or shorter wavelengths are required preferentially, they can 
be obtained through further moderation outside the reactor core at 
the desired temperature; not surprisingly, the outputs are known as 
hot and cold sources. 


Elastic collisions and moderation 


The moderation process, where a fast neutron is slowed down by an elas- 
tic collision with a slower nucleus, can be understood through a simple 
example from billiard ball dynamics: a particle of mass m and speed u 
strikes one that is stationary and of mass M head on; after the collision, 
their velocities are v and V, respectively, as shown. 


u v us 
Before After 


The relationship between u and v can be ascertained in terms of m and M 
by applying the principles of the conservation of momentum and kinetic 
energy. The former requires that 


mu = mo T MV, (3.49) 
whereas (twice) the latter implies 
mu? = mv’ + MV". (3.50) 


After substituting for V from eqn (3.49) in (3.50), the resultant quadratic 
equation for the ratio v/u can be solved to give 


a 


w m M 


(3.51) 


The alternative result of u = v, and V =0, is not physically meaningful 
as m would then need to pass through M! The ratio of v/u is negative 
if m< M, meaning that m bounces backwards, while both particles move 
forward when m > M. The speed of the incident particle is smallest after 
the encounter, in fact zero, when m = M. Hence eqn (3.51) supports the 
assertion that the energy exchange in collisions is greatest for particles of 
similar mass, and explains why neutron moderators consist of light, if not 
hydrogenous, materials. 

The scenario above is called an elastic collision because kinetic energy 
is conserved in the impact. From the perspective of mass m, however, it is 
an inelastic scattering event since |u| Æ |v]. 
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An alternative source of neutrons, which has become increasingly 
important since the 1990s, is based on accelerator technology sim- 
ilar to that found at synchrotron facilities. In essence, a stream of 
high energy protons (~ 1 GeV) strikes a target made from a heavy 
element, such as tungsten or mercury, which results in neutrons 
and protons (and other subatomic byproducts) being knocked out of 
the nuclei in a process called spallation. Although this debris of 
ejected nucleons is too hot to be moderated effectively, less energetic 
neutrons ‘evaporate’ from the remnant nuclei when they relax from 
their excited states; these secondary emissions are more amenable 
to thermodynamic taming. 

Spallation sources have a big political advantage over nuclear re- 
actors, in that their construction is far less controversial. This is 
mainly because they don’t need fissile fuel, as it’s not a requirement 
of the target material. There is also no danger of a runaway chain 
reaction, or meltdown, as the spallation process stops when power 
is cut off to the accelerator. 


3.5.1 The time-of-flight technique 


Unlike conventional nuclear reactors, which have a continuous out- 
put, neutrons at spallation sources are produced in short bursts as 
focused bunches of protons are made to hit the target periodically. 
This is illustrated schematically in Fig. 3.18, where the contribu- 
tion from the under-moderated, or epithermal, neutrons has been 
omitted for clarity; this fast component would appear at very short 
times after each pulse, at integer values of r. The repetition rate, 
1/7, depends on the facility, but is typically between 10 and 60Hz. 
The lower value is better suited to the longer wavelength neutrons 
emerging from a cold moderator (20K rather than 300K, say), as this 
allows sufficient time between the pulses for these slower particles fee wet > 
to traverse the experimental apparatus. 

Scattering measurements at reactors and synchrotrons are gener- 
ally carried out with a beam containing particles of a single (known) 
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Fig. 3.18 A schematic illustration of the flux of thermal neutrons, from moderators 
at 300K and 50K, at a spallation source with a repetition rate of 1/7, as compared 
with the uniform temporal output of a conventional nuclear reactor. 
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wavelength. As noted in Section 3.13, this enables the angular de- 
flections (20, ¢) to be transformed into the corresponding wavevector 
transfer Q, with eqn (3.8), for elastic interactions. Such a monochro- 
mation is inefficient, however, since most of the incident flux is lost 
in the selection procedure. It is unnecessary at a spallation source, 
because the common starting point of all the neutrons, following the 
proton pulse, means that the wavelength of each can be determined 
from basic kinematics and eqn (1.18): 


h (t + to) 


ee ee 
My (L+ Lo)’ 


(3.52) 


where t is the time, measured from the target impact, at which an 
elastically scattered neutron is detected at a given flight path dis- 
tance L from the moderator; to and L, are small offsets that have to 
be calibrated. This is the basis of the time-of-flight technique. The 
number of deflections can be normalized against the spectral distri- 
bution of the incident neutrons, obtained in the same way by placing 
monitors in the multichromatic ‘white’ beam prior to the sample. 


Surfaces, interfaces and 
reflectivity 


For our first example of elastic scattering, let’s consider the case of 
specular reflectivity. This type of experiment is used in the study of 
layered materials, and is a good starting point because it simplifies 
to a one-dimensional problem. 


4.1 Reflectivity and Fourier transforms 


The geometrical setup for specular reflectivity is shown in Fig. 4.1: 
a beam of X-rays, or neutrons, of wavelength \ impinges at a graz- 
ing angle 0 on a planar sample, and the fraction that bounce back 
with the same attributes is ascertained. These measurements are 
repeated for different angles at a given wavelength, or the other 
way around with a multichromatic source, and yield the reflectivity 
curve R(Q), where 
4n sinl 


Q = 3 (4.1) 


and, by definition, 0 < R < 1. The central question is: How does the 
structural composition of the sample, as encapsulated by its three- 
dimensional SLD function /(r), relate to R(Q)? 

We should begin by emphasizing that our interest here is purely 
in layered samples. That is to say, to a good approximation, their 


Fig. 4.1 A schematic illustration of an X-ray or neutron, of wavelength à, being 
reflected specularly, at an angle 0, from the surface of a layered sample. 
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d?r = drdydz 
Qer = EQe2 t+ yQy + 2Qz 


Q =Q24+0;+Q; 


SLD is invariant in the x—y plane: 


Blz) for |x) <i, and L Lys 
ga) = [PO for lal lyl<Ly a 
0 otherwise, 


where the incident illumination, of the X-rays or neutrons, has been 
taken to be over a finite rectangular region, |x| < Ls and |y|<L,. In 
the limit of weak scattering and the Fraunhofer regime, as discussed 
in Section 3.3, the elastic differential cross-section is given by the 
Fourier transform of the SLD function: 


Bajora] 
V 


where V is the total volume within which scattering can occur. The 
substitution of 6 (r) from eqn (4.2), and the evaluation of the separa- 
ble integrals with respect to x and y along the lines of Section 2.5.2, 
leads to 


2 


do sin?(LzQz) sin? = pe el2Qz 
(a). x 16 aa [oe B(z dz| . (4.8) 


The Q, and Q, prefactor on the right-hand side is a product of sinc- 
squared functions, identical to that for k, and k, in Fig. 2.19(a) with 
D,=2L,z and D,=2L,. It has a maximum when 


Q:=0 and Q, =0, (4.4) 


of value 16 L? L2, and is concentrated in the narrow region 
T T 
Qel < z7 ad |Q,] < T, (4.5) 


For a sample consisting of planar layers, therefore, most of the in- 
cident beam is scattered close to the specular condition of eqn (4.4). 
In this limit, eqn (4.3) reduces to 


2 


do ; 
Sas 1 272 J 1zQ 4. 
( a). x 16L, Ly p(z) e dz | , (4.6) 


with Q, =-—Q given the geometry of Fig. 4.1. 

The dependence of R(Q) on ((z) is closely related to eqn (4.6). 
To see this, consider the definitions of the elastic differential cross- 
section and specular reflectivity. The former was given in Section 
3.1.2, 


do Number deflected by (20, ġ) per unit solid angle 
dQ Number of incident particles per unit area of beam ’ 
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with the interactions being implicitly taken as elastic, and the latter 
is simply 


Rate of specular reflective scattering 
R SS eS er 
Rate of incidence 


A comparison shows that the denominators are linked through the 
area of the sample illuminated perpendicular to the incoming beam, 
4 Ly Ly sin0, whereas the numerator of do/dQ has to be integrated 
over a small solid angle, AQ, in the neighbourhood of the specular 
condition to be equated with R: 


1 do 
R(Q) = 4L; Ly sind I (Sn). (4.7) 
AQ 


This double integral can be evaluated by exploiting the compact na- 
ture of the differential cross-section of eqn (4.3), by multiplying its 
limiting form in eqn (4.6) with the solid angle corresponding to the 
central portion of the region enclosed by eqn (4.5): 


do AQ do 

1 aa ed H 
[| (g8).2 4 x (a), 
AQ 


where, after a consideration of the kinematics of elastic scattering 
in the vicinity of specular reflection, it can be shown that 


Qe 0,Qy 0,Q: -Q 


16 7? sin 6 
AQ x —____.., (4.8) 
Laly Q" 


This approximation to the integral in eqn (4.7) leads to 


2 


2| % , 
RQ) x E | foe iea], (4.9) 


Q 


where the exact area of the illumination conveniently cancels out 
but has been assumed to be large compared to the wavelength of the 
radiation. 

The proportionality of eqn (4.9) should, in fact, be an equality. The 
former was introduced to compensate for the variety of conventions 
in use for the normalization of the differential cross-section, but it’s 
unnecessary with the scattering length density. The SLD is easiest 
to calculate for a uniform slab of a single element, especially for the 
case of neutron scattering by the nucleus. It’s just the product of the 
coherent scattering length, (b), and the number of atoms per unit 
volume, 7: 


B= nb): (4.10) 
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Q = ki—k; = (—e, —6,2k,—€) 


Iki] = VRE + RP = = 


<P 


kz 


kz 
ki 


The angular spread of specular reflections 


Although specular reflection is defined by Qs =0, Q, =0 and Q.=-Q, 
with reference to Fig. 4.1, the finite size of the illuminated sample means 
that there is appreciable scattering in the neighbourhood of this condition. 
Over what solid angle, AQ, does it spread? 

Taking the incident beam to be parallel to the x-axis, the incoming and 
outgoing wavevectors, k; and ky respectively, are given by 


ke = (Sr, 0.82) and ky = (ket+e,6,€—kz) , 


where non-zero values of <, ô and € characterize departure from the specu- 
lar condition. The requirement of elastic scattering, |k;|” =|k;|°, imposes 
the restriction 

Ee keer LE DE = A 


For small deviations, when the quadratic terms «°, 5” and £€° are negligi- 
ble, this implies € ~ € kz/kz. Hence, the unit vector along the scattered 
beam, k;, is a function of the independent variables «€ and ô: 


(kate, ð, [4] e—ke) 
Vike +k? 


AQ is then the area of the spherical shell generated by the vertex of k 
as « and ô take on their allowed range of values in the vicinity of «=0 and 
o=0: 

The extent of the local departure from specular reflection is given by 
the boundaries of the rectangular region in eqn (4.5). These limits on Qz 
and Q, are equivalent to € =+7/L, and 6=+7/Ly, respectively, and 
lead to an estimate of AQ through the area of the (small) parallelogram 
generated by the four corresponding ky (e€, ô) corners. Making use of the 
vector properties outlined in Section 2.1.1, and that of the cross product 
in particular, 


ky (e, 5) x 


aa m | [B50 -Eo [E0 =) -E0 


2 
T a (-#,0,1) 
Le Ly (kg +k?) = 


The evaluation of the modulus on the far right-hand side, and some alge- 
braic rearrangement, allows the solid angle to be written as 


4r? sin 6 
AQ & e 
Dalene 


where sinf = kz /v kg +k? from elementary trigonometry. Finally, the 
expression for AQ given in eqn (4.8) follows from the specular wavevector 
transfer Q- = 2k, = - Q. 
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Silicon, for example, which is often used as a substrate upon which 
layers of sample material are deposited, has (b) =4.15 fm, an atomic 
mass m = 28.1gmol—! and a density p = 2.33gcm~°; this yields an 
SLD of 


N z 
6 = maxi, 
mM 


where the Avogadro constant N, = 6.02 x10723 mol~! and the values 
of (b) and p were multiplied by 1075 and 107°4, respectively, to have 
consistent length units of Å. 

Rather than consisting of a single element, most layers are molec- 
ular. If the chemical formula indicates that there are K types of 
atoms, of which #; are of variety j with scattering length (b),, then 
the SLD is given by 


p Na 5 
j= D #; (d), |, (4.11) 


where m = ` #;m; is the mass of the molecule and p is the bulk 

density. For the oxide layer which builds up on exposed surfaces of 

silicon, SiOz, 

p Na 
m 


D= 


(Os: 42 (bo) — 35x10-8 A, 


where p = 2.20gcem~?, m = 60.1gmol~', (b); = 4.15 fm (as above) 
and (b), = 5.80fm. The equivalent formula for X-rays is the same 
as eqn (4.11), except that the (b); are replaced by a product of the 
atomic number, Zj, and the Thomson scattering length, re, which 
was met earlier in eqn (3.19): 


(0); > ZjTe |- (4.12) 
As such, 8 is proportional to the electron density. In the case of the 
silicon substrate and its oxide layer, the X-ray SLDs are 19.7 and 
18.6 x10-6 A” respectively. 

Although the Fourier relationship between the depth profile 3(z) 
and the reflectivity data R(Q) is apparent from eqn (4.9), it is often 
easier to make the connection with the physical insight gained in 
Section 2.5 by using it in an alternative form involving the deriva- 
tive d8/dz: 


(4.13) 


which follows from eqn (4.9) through ‘integration by parts’. The mild 
technical proviso in the derivation, that G—0 as z— +oo, must be 
fulfilled inherently if 3(z) is to satisfy the Dirichlet condition. 


aio? 2 
Neutron X-ray 
O O 
35:9 18.6 
Zell 19.7 
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2.1x1078 R7? 


z (A) 


4.1.1 Substrate only 


As a first example of eqn (4.13) in action, let’s consider the simplest 
possible situation: a bare substrate, with an SLD of 8.. For an ideal 
surface, the depth profile is a Heaviside function with a step discon- 
tinuity at the interface: 


E Bs for z<0, (4.14) 
A= 0 tor z>0. l 


It’s illustrated in Fig. 4.2 for the case of silicon with neutrons. Since 
the gradient of this (z) is zero everywhere except z= 0, the deriva- 
tive is a scaled 6-function, 

do 


Being extremely narrow, its Fourier transform is uniform: 


CO 


Bs [se eltQqz = Bs = Bs. 


— Co 


Hence, eqn (4.18) predicts a smooth fall off in reflectivity with the 
prefactor dependence of Q7?: 
16a" 6. 
Qt 
Such a gradual decay is evident in Fig. 4.2, although the trend does 
not hold close to the origin. This break down is inevitable because 
eqn (4.16) violates the physical constraint that R<1 as Q—0. We'll 
leave a discussion of the low-Q limitations of eqn (4.13) to Section 
4.2, and continue exploring how a basic knowledge of Fourier trans- 
forms can help us to understand reflectivity data. 


Equation (4.16) also predicts that the reflectivity will scale with 
the square of 3,. This dependence is verified by the vertical shift of 


R(Q) x (4.16) 


dß/dz R 


i > Q (4!) 
(0) 0.1 0.2 


Fig. 4.2 The SLD depth profile, 6(z), and its derivative, d8/dz, for a bare substrate with a ‘perfect’ surface; the choice of 8s 
corresponds to silicon for neutron scattering. The reflectivity data predicted by eqn (4.16), indicated by the dashed grey line, is 
in good agreement with the ‘true’ R(Q) for Q > 0.02 a"; 
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R(Q) in Fig. 4.3, which shows the situation when the SLD is about 
three times greater than that in Fig. 4.2. 

So far, we have considered substrates with very sharp interfaces. 
The transition of the SLD at a boundary tends to be diffuse in prac- 
tice, and an example is given with the dashed grey line in Fig. 4.3. 
This characteristic of the surface is often modelled by a Gaussian, 
so that the derivative 


d8 Bs a 

—=- — 4.1 

T . = exp( =) (4.17) 
instead of the 6-function of eqn (4.15); the latter is recovered in the 


limit of o — 0. The Fourier transform is also Gaussian, but of the 
reciprocal width 1/c: 


CO 


d i 2()2 
[pera = — bs ep(- 2E I 


— Co 


Substitution into eqn (4.13) predicts that the reflectivity will be that 
of eqn (4.16) modulated by a Gaussian with a standard deviation of 
(o v2)": 
292 

RQ) & a x exp(—02Q?). (4.18) 
The more rapid decay with increasing Q expected from a bare sub- 
strate with a ‘rough’ surface can be seen in Fig. 4.3. In fact, this is 
a general feature of all SLD functions that contain diffuse interfaces 
and is akin to the Debye—Waller factor of Section 3.3.2. It is easiest 
to understand in detail when all the layer boundaries have the same 
(or similar) level of roughness, c, through the use of the convolution 
theorem of Section 2.4.1, 


R(Q) © R(Q) x o”, (4.19) 


where Ro(Q) is the equivalent reflectivity curve for a sample with 
sharp interfaces. 


6 dp /dz R 


6.5x1078 R7? 


z (R) 


T T T “*s r> Q (87!) 
—80 —40 : 


0.1 0.2 


Fig. 4.3 The SLD depth profile, 6(z), its derivative, dG/dz, and the resultant reflectivity curve, R(Q), for a bare substrate with 
Gs; about three times greater than that in Fig. 4.2. The cases of both a sharp interface (blue) and a diffuse one (dashed grey), 
with a Gaussian roughness of o=12A, are shown. 
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4.1.2 One uniform layer 


The next simplest situation involves a single uniform layer, of thick- 
ness L and SLD 64, deposited on a substrate. The ideal depth profile 
has two step discontinuities, 


Bs for z<—-L, 
Biz) =< By -L<z<0, (4.20) 
0 for z>0, 


whose derivative is a pair of scaled 6-functions: 


ae 


= = (G1 — Bs) 5(2 +L) — £1 8(2) (4.21) 


Three cases are shown in Fig. 4.4, all with a 8, of 2.1 x10~° ae the 
solid blue line has L =200 A and fi =3.5 x10-° AY’. the dotted line 
in light blue has a layer of the same thickness but a higher value of 
Bı, whereas the dashed grey line has the same SLD as the first but 
is only half as wide. The integral property of a 6-function, given in 
eqn (2.54), leads to 


f Eei az = (0) dt? — py. 


The multiplication of the right-hand side with its complex conjugate, 
to obtain the modulus-squared of the Fourier transform of dG/dz, 
and the use of the trigonometric identity +2 + e1428 = 2cos(LQ), 
enables eqn (4.13) to be evaluated as 


lea? 
Qt 
where 6; and 8, have implicitly been taken as real. Superimposed on 


the familiar Q~* decay in the reflectivity, therefore, is a sinusoidal 
variation with a repeat @-distance of 


R(Q) = 8? + (61—Gs)” -20i Bs) cos(LQ)), 4.22) 


dp /dz R 


> QR) 
0.2 


Fig. 4.4 The SLD depth profile, 3(z), its derivative, dG/dz, and the resultant reflectivity curve, R(Q), for a single uniform layer 
deposited on a substrate. Three cases are shown, all with the same (3, and sharp interfaces: the one in dotted light blue has a 
higher value of 3; than the solid blue line, whereas the dashed grey one is narrower. 
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AQ = 2n/L. (4.23) 


This can seen in Fig. 4.4, where AQ is twice as large for the case of 
the narrower layer in the dashed grey line. 

The dependence of the amplitude, A=2(3;((3,—,), and the offset, 
B=6?7 + (Ai- oy, of the oscillations, B— Acos(LQ), on the magni- 
tude of 3, and its difference from p, can also be verified in Fig. 4.4: 
the visibility of the fringes, and the overall level of the reflectivity, 
is comparable for the solid blue and the dashed grey cases, but is 
noticeably greater for the dotted light blue one. As required theoret- 
ically, the expression of eqn (4.22) reduces to eqn (4.16) when either 
6,=0 or 0, = fs. 

The last thing to note is that the fringes start to become washed 
out as Q increases. This is not predicted by the analysis outlined 
here, but was encoded in the computer simulation of the data to 
mimic the blurring effect of a finite experimental resolution. This 
point-spread function was approximated by a Gaussian whose stan- 
dard deviation, c, varied linearly with Q; a fractional value of oc /Q = 
2% was used, although 3 to 5% is more typical in practice. Such a 
resolution blurring was also included in Figs. 4.2 and 4.3, but is im- 
perceptible because the reflectivity curves of eqns (4.16) and (4.18) 
do not contain any fine structure. Statistical noise, which limits the 
smallest reflectivity that can be measured reliably, was omitted to 
enhance clarity. 


4.1.3 A short multilayer 


The next example, shown in Fig. 4.5, involves three alternating lay- 
ers of two materials deposited on a substrate; for simplicity, all the 
thicknesses have been made the same. Rather than relating the cor- 
responding reflectivity curve to the SLD depth profile analytically, 
let’s try to understand it qualitatively with the help of the physical 
insight into Fourier transforms gained in Section 2.5. Since d3/dz 
for the multilayer is reminiscent of the aperture function, A(x), in 


f dB/dz R 


—600 —400 —200 


z (A) 
-600 -400 -200 


a— AQ — 


> Q (a!) 
0.2 


Fig. 4.5 The SLD depth profile, its derivative and the resultant reflectivity curve for a short multilayer deposited on a substrate. 
The reciprocal of the distance between the prominent peaks in R(Q), 27/AQ, gives the repeating distance of 100 A in 3(z), 


whereas the separation of the finer fringes indicates the overall thickness of the sample. 
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Fig. 2.17, at least when the minus sign of half the -function ampli- 
tudes is ignored, we might expect the modulus-squared of its Fourier 
transform to exhibit structure similar to the diffraction intensity, 
I(q), for a finite grating. This is indeed the case, with R(Q) showing 
a regular pattern of prominent peaks interspersed with a number 
of finer fringes (superimposed on the Q~* decay); the latter become 
washed out at higher Qs due to resolution blurring. The reciprocal 
of the spacing between the main peaks, 27/AQ, gives the repeat 
distance of 100A in G(z). The number of finer fringes and their spac- 
ing, as well as the width of the major peaks, is related to the overall 
thickness of the multilayer. 

The sort of intuitive Fourier reasoning used above can also be ap- 
plied to other situations. The pair of 5-functions defining d@/dz in 
Fig. 4.4, for example, resemble the aperture function in a Young’s 
double slit setup. By analogy with Fig. 2.14, therefore, we would 
expect the reflectivity curve to contain uniform cosine fringes whose 
spacing depends inversely on the thickness of the associated single 
layer. These types of argument demonstrate the power of the Fourier 
relationship of eqn (4.13) in helping us to understand the connec- 
tion between reflectivity measurements and the compositional depth 
structure of the sample. Before discussing its breakdown at low Q, 
let’s highlight the difficulty inherent in Fourier analysis when the 
complex phase is not available. 


4.1.4 A phaseless Fourier ambiguity 


Figure 4.6 shows two hypothetical samples of the same total thick- 
ness, consisting of one and two layers, deposited on a substrate with 
Bs =2.1 x10-* Å’. Even though the SLD depth profiles are distinctly 
different, the resultant reflectivity curves are virtually identical; the 
only slight discrepancy is at the first minimum. This ambiguity is 
an example of the information lost in diffraction experiments due 
to the inability of measuring the phase of the scattered wave: eqns 
(2.58), (8.32), (3.33) and (4.13) all involve a modulus-squared in the 
final step. 

The reason for the similarity of the reflectivity curves in Fig. 4.6 
can be understood by considering the auto-correlation functions of 
the relevant d8/dz. As discussed in Section 2.4.2, the ACF gives a 
real-space representation of the information contained in the inten- 
sity of a diffraction pattern: 


ACF(z) = OLGE dt = J EOQ, 


where F(Q) is the Fourier transform of f(z). In the present case, 
f(z)=q68/dz is purely real and comprises of two or three ô-functions. 
As such, it’s easiest to calculate the ACF by direct inspection. There 
is always a large positive peak at the origin because, at zero separa- 
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Fig. 4.6 The SLD depth profile, 3(z), of two hypothetical samples, plotted in solid blue and dotted grey, deposited on the 
same substrate, and the resultant reflectivity curves, R(Q). The derivatives of the profiles, dG/dz, and their auto-correlation 
functions, ACF, are also shown. 


tion, everything correlates with itself. The negative spikes at +200A 
in the ACF arise from the interference between the front and back 
of the sample, or the pair of 6-functions with amplitudes of opposite 
signs at z=0 and z = —200A in d8/dz. The extra components that 
might be expected in the ACF for the two-layer sample, at +100A, are 
absent because there is a complete cancellation between a positive 
and a negative contribution from two pairs of d-functions in d@/dz 
at that separation. This equality of the ACFs explains the similarity 
of the reflectivity curves. They would be identical if eqn (4.13) held 
exactly for all Q! 

The purpose of this simple example was to illustrate how difficult 
it is to infer uniquely the nature of an SLD function solely from a 
set of diffraction data. Since the problem of ambiguities will only 
be exacerbated with complexity in ((z), the reflectivity measure- 
ments must be supplemented with additional information. Perhaps 
the number of layers is known from the method of sample prepara- 
tion, or some of the theoretical alternatives do not satisfy physical 
or chemical constraints. In the case of Fig. 4.6, the two-layer model 
may be suspect due to the fortuitous symmetries in the SLD depth 
profile. A reliance on such prior knowledge, assumptions or beliefs 
can sometimes be avoided through clever experiments, but most of 
these require multiple sets of data connected by carefully controlled 
changes. 
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4.2 Reflectivity and geometrical optics 


Following the first reflectivity example of Fig. 4.2, it was noted that 
eqn (4.13) cannot hold as Q —0 because the physical constraint that 
R< 1 is violated. In fact, problems begin to surface with eqn (4.13) 
at low values of Q well before the divergence of the Q~* prefactor 
becomes an issue. This is illustrated in Fig. 4.7 with a simple set of 
reflectivity curves pertaining to a single uniform layer, of SLD 3 and 
thickness T, and no substrate (3, =0): the low-Q predictions of eqn 
(4.13) deteriorate as either |8| or T increases. What could be going 
wrong? 

The breakdown of eqn (4.13) at low Qs suggests that one, or more, 
of the simplifying assumptions leading to the Fourier relationship 
of eqn (3.33) is not appropriate in that regime. The most suspect of 
these is the Born approximation, whereby the perturbation of the in- 
cident wave by the scattering process is taken to be negligible. While 
a weak interaction per nucleus or atom is one prerequisite, so too is 
a ‘thin’ sample. With the grazing angle geometry of a reflectivity 
experiment, however, the straight through path length, 


Tee T L = T/sinð, 


can be rather long. We should not be too surprised, therefore, if 
the validity of eqn (4.13) becomes doubtful as @ decreases, especially 
when T is large or the layers consist of strong scatterers. Although 
the inverse dependence of L on sin@ explains why the preceding 
Fourier analysis might run into difficulties as Q 0 at a fixed wave- 
length, it offers no insight on its demise at longer As. The answer to 
that conundrum probably has to do with the reciprocal relationship 
between the momentum of an incident particle and its wavelength: 
the more energetic the projectile, the less it’s perturbed. 

To obtain an accurate description of the reflectivity curve at all 
values of Q, we must adopt a different approach to the problem. It’s 
sometimes called the dynamical theory of scattering, as opposed to 
the kinematical one of the previous section. 
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Fig. 4.7 The deviation of the kinematical approximation (dotted grey line) to the reflectivity curve (in solid blue), at low values 
of Q, for a uniform layer of SLD @ and thickness T (with 3; =0). In (a), 3=2.0 x 107° A? and T=200 A; (b) and (c) differ from 
this by having a larger SLD, of 6.0 x 1076 oe or being thicker, at T=500 A, respectively. 
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4.2.1 Refractive index 


The reflection of light from planar surfaces has been studied for cen- 
turies, and is intimately linked with refraction at the boundary: part 
of the intensity bounces off at the angle of incidence, 6;, while the 
rest is transmitted with a change in direction, to 6,. This accounts 
for the appearance of the kink in objects standing in a glass of water. 
For a given material below the air—matter interface, the ratio of the 
cosines of these angles is found to be a constant, 


cos 6; 


= (4.24) 
cos 0; 

called the refractive index. The value of n can depend on the incident 

wavelength, \;, which explains why white (polychromatic) light is 

split into a rainbow by a prism. 

Although the refractive law of eqn (4.24) is usually attributed to 
Snell (1621) or Descartes (1637), it is described in the earlier work of 
Ibn Sahl (984).' Likewise, a rudimentary form of Fermat’s principle 
of least time (1662) used to derive this observation theoretically can 
be found in the writings of Ibn al-Haytham (1021) or, even further 
back, in the ideas of Hero of Alexandria (60).' At its heart lies the 
fact that the speed of light varies with the medium through which 
it travels. In conjunction with Huygens’ wave postulate (1678), the 
refractive index is then seen to be a ratio of speeds or wavelengths 
as much as the cosines of angles: 


Ci Ar 
Ct At 


(4.25) 
where the frequency of the (electromagnetic) oscillations, v, remains 
unchanged across the boundary. Let’s see how the refractive index 


and the mathematics of one-dimensional waves yield the reflectivity 
curve for a bare substrate that is valid for all Q. 


4.2.2 Asingle boundary 


Perpendicular to the planar interfacial surface (z = 0), the incident 
wave takes the form 


W;(z) = Wo die (4.26) 


where k;=27/\ and 0;=0 for consistency with Fig. 4.1. Part of this 
signal is reflected at the boundary, and travels in the +z direction 
with a complex amplitude r, 


w,(z) =r ol zhi ae (4.27) 
while the rest, quantified by t, is transmitted: 
palz) = t eiz kt sin 6; , (4.28) 


t Kwan et al. (2002), 


Rashed (1990), Wolf (1995). 


t Mihas (2005) 


C= PÀ 
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Fig. 4.8 The net displacement of a taut string, 7, when an incident wave, p;, en- 
counters a sudden change in the mass per unit length. A portion, pr, of the incoming 
signal is reflected at the boundary (at z =0) while the rest, 7+, is transmitted. 


The wavenumber k; and the sine of 0; are related to the incoming 
values through the refractive index n: 
2 1— sin? 
ba oe te E 
A n? 
The situation is like that of a wave on a taut string when it encoun- 
ters a sudden change in the mass per unit length, and is illustrated 
in Fig. 4.8. Drawing on this analogy, constraints on r and t can be 
obtained by imposing the boundary conditions that both the fluctu- 
ation, 7, and its derivative, dz/dz, must match up at the transition. 
That is to say, at z=0, 
d 

i+ Yr =Y and = h +W) = ore . (4,29) 
Substitution from eqns (4.26)-(4.28) then leads to the simultaneous 
equations 


Wotr=t and (Yo — r) kisinb = t kisin 6, 
which can easily be solved to yield 


2 Po 


t = d 
E an r 


(; — 2) Wo (4.30) 


where 


k; sin 0 sin 0 een 


. 1/2 
_ ksin Žž n ; (l- = 
If the term in square brackets is negative, a will be an imaginary 
number and the reflectivity, 
2 2 
1-2R 
poe A, (4.32) 
1+2Re {a} + lal 
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will be unity Gust as for a=0). As this requires 
sin?ð < 1—n?, 


complete reflection occurs from materials with n < 1 for glancing 
angles below a critical value of 


6. = sin™t (v 1—n? ) . (4.33) 


The formulae for |r/Y|? and |t/po|" resulting from eqns (4.30) and 
(4.31) are known as Fresnel equations. 
Having seen how the low-Q behaviour of the reflectivity curve is 
correctly predicted by this formulation, let’s make sure that its decay 
agrees with eqn (4.16) at large value of Q. In fact, this necessary 
condition will enable us to work out the relationship between the 
SLD, 3, the wavelength, à, and the refractive index, n. Rewriting 
eqn (4.31) for a in terms of Q with eqn (4.1), and using the binomial 
expansion for the square root, 
2 2 
Pe ei ae ee ee ee 
A? Q? 2 
which becomes ever more accurate as ()— oo. Substitution into eqn 
(4.32), with 1+a ~ 2, leads to 


a 16r? paa 
~ or a | 


E l-a 
~ lita 


A comparison with eqn (4.16) shows that the term in square brackets 
must be equated with £, so that 


(4.34) 


In conjunction with eqns (4.1) and (4.33), therefore, 6. can be trans- 
lated into an equivalent critical Q, 


Qe = 478, (4.35) 


below which R=1 for 8 > 0; complete reflection cannot occur if the 
SLD is negative. Finally, the square root of eqn (4.34) with the bino- 
mial expansion yields 


n x 1- — |, (4.36) 


which is the dependence of the refractive index on the wavelength 
and scattering-length density derived by Fermi (1950) through a 
consideration of the signal transmitted by a planar slab of uniform 
material. 
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BN+1 = s and fo =0 


polz) = Pr(z) + vil) 


Pulz) = plz) 


4.2.3 Multiple boundaries 


The analysis for the reflection at a single boundary can be general- 
ized to the case of multiple transitions in the refractive index. The 
situation for N layers is illustrated schematically in Fig. 4.9, with 
the SLD depth profile being defined by 
Bs for z<zy, 
Bliz) = < Ge forz; <z <z, (4.37) 
0 tor z>0, 


where j =1,2,3,..., N and zo =0. The displacement w of the oscilla- 
tion in the uniform region j can always be expressed as 


Wlej = Ajel** + Bye 32% , (4.38) 


where A; and B; are the (complex) amplitudes of waves travelling 
in the positive and negative z directions, ~;, and yj- respectively, 
with a wavenumber 


2T 271; 1— sin?6 
k; = (—)siné; = | — fe 
j (=) SIn yj ( X ) n? ’ 


where à and @ are the wavelength and glancing angle of the incident 
beam, and n; is the refractive index of the jth layer. A rearrange- 
ment of the right-hand side, including the substitution for n ? from 
eqn (4.34) and sin@/\ from eqn (4.1), allows the relevant wavenum- 


ber to be written as 
2K; = 4/Q2—1678; . (4.39) 


Given the transmission condition that 


Anyi = 0, (4.40) 


Fig. 4.9 The oscillatory displacement w% in the uniform region j, ~;(z), can be ex- 
pressed as the sum of a forward- and a backward-moving wave, Yj} and #;—, with 
a wavenumber, kj, determined by the incident wavelength, à, the glancing angle, 0, 
and the refractive index, nj. 


4.2 Reflectivity and geometrical optics 


because there is no reverse wave in the substrate, our task is to 
match y)(z) and its derivative, ~'(z), at each of the refractive index 
boundaries, 


pisilj) = plz) and 544 (25) = vj (zs) (4.41) 


for j = 0,1,2,...,N, with the aim of ascertaining | Aj/ Bol’ to obtain 
the reflectivity. 

Since eqn (4.38) and its derivative relate 7); (z) and 7);(z), between 
Zj and Zj—1> to Aj and Bj; 


the reflectivity is given by 
ro = ae) 
and eqn (4.40) implies that 
(zw) = —iks (zw). (4.43) 


Starting from an assignment of y and 7’ at z=z, which is consis- 
tent with eqn (4.43), such as 


wizn)=1 and 7’ (zy) = —ik,, 


the wave of eqn (4.38) and its derivative can be propagated through 
the Nth uniform layer, of thickness Ty, to the other boundary at 
Z = Zn_1: 


sin(k;T}) 


; w'(zj), (4.44) 
j 


p(zj-1) = cos(kj 15) (zj) + 
b'(2j-1) = — ky sin(k;Tj) b(z;) + cos(kjTj) b"(z;), (4.45) 


with 7=N. The continuity conditions of eqn (4.41) mean that eqns 
(4.44) and (4.45) can be used recursively until y and w)’ are evalu- 
ated at the surface of the sample, at z = zo = 0, whence eqn (4.42) 
yields the reflectivity. 

This type of iterative procedure for calculating R(Q) was first de- 
scribed by Abeles (1948) and Parratt (1954). Although it gives an 
accurate prediction of the reflectivity for all values of Q, the results 
lack the physical intuition allowed by the Fourier analysis of the 
kinematic approximation. Before moving on to the next example of 
elastic scattering, in Chapter 5, let us briefly compare the strengths 
and weaknesses of X-rays and neutrons; we'll also touch upon alter- 
native techniques for surface studies. 


w= 
ko = Q/2 
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From eqn (4.45), 
b'(zt+dz) = p'(z)—k*(z) p(z) dz 
No 


dy’ 
taz 


TW. A. Hamilton (1989), 
private communication. 


ce 


dk 
2 k(z) TAN An a 


The kinematic approximation to dynamical reflectivity 

For an arbitrary SLD profile, 6(z), described by a huge number (N — co) 
of extremely thin layers (of width dz), the problem becomes one of solving 
the second-order differential equation 


2 2 
So + [E —ana(2)}v~@ = 0, (4.46) 


subject to pure transmission within the substrate (z < zs), to evaluate eqn 
(4.42) for the reflectivity. While an analytical solution for y(z) is difficult 
to obtain in general, progress can be made when Q? > 167 (z); it leads 
to the kinematical formulae of Section 4.1. A derivation of eqn (4.13) is 
outlined below.' 

Consider the function 


re) — ERle) le) + ¥'(@) 


ik(z) (z) — o"(z) ` 


where k?(z) = Q?/4—47 G(z), so that the reflectivity of eqn (4.42) is equal 
to |r(0)|?. In conjunction with eqn (4.46), and a fair amount of effort, it 
can be shown to satisfy the first-order differential equation 

dr w dO ( 2 1) 


— —i2k = al 
dz ar kalz) dz j 


This becomes tractable in the limit of large Q and low reflectance, when 
k(z)— Q/2 and |r| <1, because it can then be solved with the aid of an 
integrating factor: 


Integrating both sides with respect to z from within the substrate, where 
r=0, to the surface of the sample, at z=0, 


0 
r 0 7 
[re] = ;0) & ee J de ize dz. 


The kinematic formula of eqn (4.13) follows from the observation that the 
upper limit of the integral can be set to +00 since d8/dz =0 for z>0. 


4.3 X-rays, neutrons and other techniques 


The differences between X-rays and neutrons in terms of reflectivity 
measurements are much the same as for all scattering experiments, 
and follow from the properties of these probes and their production 
facilities. Due to their accessibility, laboratory-based X-ray sources 
are usually the first port-of-call; reflectivities as low as 107°, and Qs 
as high as an A~!, are achievable under favourable conditions. By 
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Neutron 


> Q(a"') 


Fig. 4.10 The X-ray and neutron SLD depth profiles, 3(z), and the corresponding reflectivity curves, R(Q), for the same sample. 
While the former has a better signal-to-noise ratio, and is reliable out to a much higher value of Q, the lack of contrast in the 
X-ray SLD gives rise to uniform fringes that only indicate the overall thickness of the sample (160A). This length-scale can also 
be seen in the poorer quality neutron data, but the additional broader variation marks the presence of the (50A) subdivision. 


comparison, it’s difficult to go below R~10~°, or beyond Q = 0.2Å7*, 
with neutrons because of the combination of a poorer incident flux 
and a greater background signal from incoherent scattering. These 
drawbacks can be offset by the increased contrast in the SLD depth 
profile for neutrons, leading to a richer structure in the data, since 
the scattering lengths do not vary monotonically with atomic num- 
ber. Indeed, the ability to alter 3(z) through isotopic substitution, 
or with the polarization of the incident beam for magnetic samples, 
makes neutrons a versatile probe. 

A complementary technique for ascertaining the refractive index 
depth profile is provided by ellipsometry, whereby the reflectivities 
parallel and perpendicular to an incident plane-polarized light beam 
are analyzed simultaneously. Atomic-scale detail can be inferred 
despite the much longer wavelengths, but thin samples are trouble- 
some. Information can be obtained in a more direct fashion with 
the bombardment of the sample with energetic ions, typically hy- 
drogen or helium of several MeV, and an examination of the ejected 
‘debris’; such an ion beam analysis tends to have a depth resolution 
of only 10 to 20nm, however, and is restricted to work with solids. 
The measurement of the surface tension at a liquid surface can also 
be useful. 

The main assumption made in this chapter was that of a layered 
sample which is invariant in the x—y plane; this reduced the analy- 
sis to a one-dimensional problem. If the approximation of eqn (4.2) 
is not adequate, a significant fraction of the scattered intensity will 
lie outside the narrow region defined by eqn (4.5). In terms of Fig. 
4.1, there will be a substantial signal even when the angle of inci- 
dence, 0;, is not equal to that of reflection, 0», or when they are not 
in the same plane. The off-specular pattern tends not to be highly 
structured, however, due to the lack of a regular repeating pattern 
in the z—y domain. Nevertheless, the spread of the diffuse intensity 
gives an indication of the characteristic length scale(s) in the plane 
perpendicular to z via their reciprocal (Fourier) relationship. If the 
sample is solid and the x —y correlations are down to features on 
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Fig. 4.11 A grey-scale image of the reflectivity, as a function of the neutron wave- 
length à and reflection angle 0,, from a monolayer of two phospholipid components 
on top of a water substrate (Hughes et al., 2007). The blue lines show the correspond- 
ing one-dimensional data along the ‘specular ridge’, 6, = 0; (the angle of incidence), 
and at a constant A x 4.5À. 


the exposed surface (at z ~ 0), the origin of the off-specular signal 
can be explored with atomic force microscopy (AFM). Brewster-angle 
microscopy can help with the study of inhomogeneities on a liquid 
surface, but the spatial resolution is far from being atomic (closer to 
a um than an A). 


Small-angle scattering and the 
big picture 


Many biological and physical processes are controlled by the gross 
characteristics of large molecules and aggregates. Information about 
the size, shape and assemblies of these entities is obtained with 
small-angle scattering. 


5.1 Diffraction and length scales 


In Section 3.3, we learnt that the differential cross-section for elastic 
scattering, (do/d?).,, is related to the SLD function, ((r), through 
its Fourier transform: 


(35), > JIJ pea dir i 6.1) 


where V is the volume of the sample within which the scattering 
can occur. One of the most basic properties of Fourier transforms is 
the inverse relationship between length scales in real, r = (2, y, z), 
and reciprocal, Q = (Qz, Qy, Qz), space. If the size of a compact 
object is defined by the three r-lengths d,, dy and d,, for example, 
then the magnitude of its Fourier transform will decay towards zero 
beyond a Q-distance of around 27/d,, 27/d, and 27/d, from the ori- 
gin. Conversely, the lack of (reliable) scattering measurements for 
|Q|-values larger than Qmax limits the level of the fine detail that 
can be inferred about the sample to of order 27/Qmax. Experiments 
focusing on |Q| <1nm~! provide low-resolution structural informa- 
tion and are, therefore, good for seeing the ‘big picture’: the overall 
configuration and assemblies of macromolecules, and other objects 
of a similar size. The scope and context of the technique is summa- 
rized in Fig. 5.1. 

The relationship between the wavelength of the incident parti- 
cle, the scattering angle and the modulus of the elastic wavevector 
transfer was given in Section 3.1.1 as 


4r sinô 
U= T sin (5.2) 
À 
Thus low-Q experiments benefit from large As and entail measure- 


ments at small 0s. The latter, achieved by having a long secondary 


[Fourier Transform|* 
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Fig. 5.1 An overview, following Pynn (1990), of how objects spanning length scales 
from 10710 to 1074m can be probed by a variety of techniques. Small-angle X-ray 
and neutron scattering experiments are suited to low resolution structural studies of 
macromolecular complexes, porous materials and microscopic aggregates. 


flight-path, as indicated in Fig. 5.2, gives rise to the general name 
of Small-Angle Scattering; it’s often abbreviated to SAXS and SANS 
for X-rays and neutrons, respectively. By contrast, diffraction exper- 
iments requiring atomic resolution need data out to higher Qs and, 
hence, wider angles. 


5.2 Size, shape and molecular form factors 


Most small-angle scattering experiments are carried out on dilute 
solutions, so that the sample consists of isolated and isotropically 
oriented examples of the object of interest located randomly within 
a sea of solvent. If all N specimens are identical apart from their 
position, r;, and orientation, ©;, where j =1,2,3,...,N, then (G(r) 
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Detector 


Fig. 5.2 The setup for a low-Q experiment requires a long sample-to-detector distance, Lsp, encased in an evacuated tank to 
avoid losses due to scattering by air molecules. The elastic wavevector transfer, Q, is related to à, 0 and ¢ through eqn (8.8), 


with a signal at (x, y) corresponding to tan¢@ = y/x and tan 20 = \/x2+y?/Lgp. 
can be expressed as 
N P 
B(r) = bo +X belr-r;, 5) , (5.3) 
where 6, is the SLD of the solvent and 


(5.4) 


A Be(r, ©) — 6. for r inside solute , 
B-(r,O) = 


0 for r in the solvent , 


where e(r, ©) is a compact SLD function which characterizes the 
entity being studied (centred at the origin and with orientation ©); 
it is illustrated schematically in Fig. 5.3. For the simple case of a 
spherical complex, which has no special directions, with a constant 
SLD of @; and radius R, 


Bı for |r|<R, 


0 otherwise. 


bcr, ©) = i (5.5) 
Since the Fourier transform of an invariant is a 6-function, and those 
of shifted functions are related through a phase factor, as listed in 
Section 2.7, eqns (5.1) and (5.3) lead to 


2 
do 3 Ti er; a +) iQer 
(aa), eo [Jfa ar, 


where Ve is the volume of space encapsulated by a solute structure 
that is centred at the origin with orientation ©. When the summa- 
tion is expanded and the modulus-squared evaluated, the part that 
adds up coherently, for uncorrelated positions r; and r;, is 


d N p3 ; 2 
(aa) p? JJ [P Oe" dr] a (5.6) 
a) Sa J 


Fig. 5.3 A schematic illustration of eqn 
(5.3), with grey indicating solvent. 
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Fig. 5.4 The spherical Bessel function 
y =ji(x) and, dotted, y = 3j1(x)/z. 


where the sharply peaked signal at Q=0 has been omitted. The lat- 
ter coincides with the intense, unscattered, straight-through beam, 
which would damage sensitive measuring equipment placed in its 
path; this is the reason for the hole (or beamstop) at the centre of 
the position-sensitive detector in Fig. 5.2. With an isotropic distri- 
bution of the ©, eqn (5.6) reduces to 

|) (5.7) 

© 


where the angled brackets represent an orientational average, of the 
Fourier transform of 3.(r,©), and N is proportional to the concen- 
tration of the solution. 

The formula of eqn (5.7) simplifies further when the solute parti- 
cles are spherically symmetric; that is to say, 


= (finn 


B.(r,@) = Bo(r) 


where r = |r|. Not only does the orientational average then become 
redundant, but the Fourier transform reduces to a one-dimensional 
scalar integral: 


R 
do ~, sin(Qr) 
(a). X N (47)? i? DAT) “Qr dr 5 (5.8) 
0 


where R is the radius of the object of interest. For the case of eqn 


(5.5), when 6.(r)= 61—Bo, 


R 


2 
(5). x N = po | fr sin(Qr)dr| . (5.9) 


0 


The integral, which can be done ‘by parts’, is related to a jı spherical 
Bessel function, plotted in Fig. 5.4, 


R 
f sin(Qr) dr = — = R’j\(QR), (5.10) 
0 


so that eqn (5.9) yields 


d 3 71(QR) |? 
Gk N [V | 61 — Bo = (5.11) 


where V,= 47R°/3 is the volume of the solute particle. 
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Fourier transforms in polar coordinates 


The evaluation of a Fourier transform, or any multiple integral, becomes 
easier when conducted in a coordinate system that matches the symmetry 
of the problem. For the three-dimensional case, defined formally in Carte- 
sians in eqns (2.64) and (2.65), it makes most sense to work in spherical 
polars when f(r) =f(r). Denoting the angle between the vectors k and r by 
q), so that ker =k r cos y with 0< y <r radians, and that of the projection 
of r in the plane perpendicular to k by 0< x < 27, the Fourier transform 
of f(r) becomes 


T 


Is 20 
BR) x toe f(r) ar [sinw y fax ; 
0 0 


0 


with M =3 in eqn (2.65), where the volume element d?r =r? sin y dr dw dy 
and f(r) = 0 for r> R. Carrying out the x integral first, which gives 27, 
and then the one for w, 


R r et Er cos Y m 
F(k 2 i — 
(le) ex rfr (in) eee a 
0 


and recognizing that e!”’— e1*" = 2isin(kr), from eqn (2.32), upon sub- 


stitution of the y limits, yields 


R 
[ff elker gp = an fr? f(r) oe dG (5.12) 


0 


In the two-dimensional circular case, which is also relevant for cylindri- 
cal symmetry, only the angle y is needed. While ker is still k r cosy, with 
0<w< 27 to cover all directions, the element of area d?r =rdrdy. The 
corresponding w integral is not so straightforward without the prefactor 
of siny to accompany the cosy in the exponential but, to within 27, is a 
Jo Bessel function: 


27 
ener dy = 27 Jo(kr) . (5.13) 
0 
Jo(x) is similar to a sin z/x sinc function, with the latter also being known 
as a jo(x) spherical Bessel function; both are plotted in Fig. 5.5. The two- 


dimensional form of a Fourier transform in plane polar coordinates then 
follows readily, 


R 
[fo ker g?r = 2r fr f(r) Jo(kr) dr, (5.14) 


0 


and is illustrated in Fig. 2.19 for the case of f(r) =1 for r< R, and zero 
otherwise. 


Fig. 5.5 The y = Jo(x) Bessel function 
and, dotted, the sinx/x sinc function. 
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J|[ DRAA a 


= f(Q) 8 R(Q) 
if R(q,Q) = R(Q-q) 


5.2.1 Absolute intensities 


The intensity of the measured elastic signal, I« (Q), is related to the 
differential cross-section by 


do 


lQ) = LAA T(A) AQ (=) @R(Q)+BQ), 615) 


where I», 7 and T, which may depend on wavelength, denote the in- 
cident flux, efficiency of the detectors and the transmission function 
of the sample, respectively; AQ is the solid angle subtended by the 
detector-element, corresponding to a wavevector transfer of Q, at 
the sample position. B(Q) represents a slowly varying background 
and R(Q) the instrumental resolution function. Strictly speaking 
the blurring effect of the latter can only be written as a convolution 
(@) for an invariant response, whereas the width of R(Q) generally 
increases with |Q|, but we'll ignore this technicality for simplicity of 
notation. 

Although T(,) will depend on the overall composition of the sam- 
ple, as might some aspects of B(Q), the details of the structure are 
encoded in the differential cross-section. At the origin, eqn (5.7) pre- 


dicts that 
do ~ 5 
(5) x (| [[[aeores (5.16) 
dQ). 
Q=0 Ve (S) 


Taking Be(r, ©) to be uniform within the volume defined by Vo, and 
equal to 3,—(, so that 


J| [EEr = (B-b) f| far, 
Və Vo 


and recognizing the integral on the right-hand side as the volume of 
the solute particle, Vp, irrespective of the orientation ©, 


(5.17) 


The SLDs 6, and (; are given by eqn (4.11), with a summation over 
the atoms of the solvent and solute respectively. 

The equations for the differential cross-section were written with 
a proportionality to allow for the variety of normalizations in use. 
The most common in small-angle scattering studies is ‘per unit scat- 
tering volume’, V, whence 


do 
dQ), 


N 
7 S) Jo Bo” (5.18) 
Q=0 
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If the concentration of the solution is defined dimensionlessly by the 
volume fraction, ®, then the number of solute particles is 
ees, 
Vp 
Their individual volumes, V,, can be related to macroscopic quanti- 
ties, such as the density of the solute, p, and its molecular mass, m, 
through the Avogadro constant, N,: 
m 
V, = —, 
P p N, 
where the units of V, are cm? when p is given in gem~? and m in 


gmol`t!. If 6, and 8, are specified in cm~? (or 10716 Å-?), then the 
differential cross-section, 


; (5.19) 


will be in units of cm~!. 


5.2.2 The Guinier approximation 


The relationship between the value of the differential cross-section 
at Q=0 and the density, molecular mass, concentration and SLD 
(relative to 8o) of the solute, quantified by eqn (5.19), provides a 
useful constraint on structural models for the sample. The problem, 
however, is that the intensity of the scattered signal at the origin, 
Ia(0), is not measured since it coincides with the ‘hole’ at the centre 
of the detector (Fig. 5.2). Even without this practical shortcoming, 
there is a fundamental difficulty with eqn (5.19): it does not hold at 
Q =0 because it is based on eqn (5.6) in which the 6(Q) component 
was omitted! It does apply asymptotically as Q — 0, and an analysis 
of its behaviour as the origin is approached sheds light on the size 
of the solute particles. 

The first step in the derivation of eqn (5.19) involved the substi- 
tution of unity for the exponential term in eqn (5.7). If Q40, but is 
small, then a Taylor series expansion about the origin gives 

der — 1 +iQert+ = (iQer)? Fasty 
where higher-order terms become increasingly negligible as |Q|— 0. 
Thus eqn (5.7) can be expressed as 
2 
(5.20) 
e 


(aa), > z © f[[Re o See a 


with eqn (5.16) being the case when only the zeroth-order term con- 
tributes to the summation. In the limit of small Q, we might expect 
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the linear term to give rise to the main deviation from eqn (5.19). 
This is not so, however, as 


Je» B.(r,@) dr = 0 
Ve 


if Be(r, ©) = B.(—r, ©). For solute particles with centric symmetry, 
therefore, the quadratic factor, 


-3 | [fran B(x, @) dr, 
Və 


dominates the drop-off in scattered intensity away from the origin. 
Taking (.(r,©) = 8ı— bo within Vo, as earlier, eqn (5.20) is approx- 
imated well by 
) 
© 


do\ | (N 2 1 2 
(in), Ee aai- ay front er 
Vo 


in the neighbourhood of Q=0. Expanding the modulus-squared and 
using the linearity of the averaging, 


do\ _ (do _ S 2 43 
aaler] on 
= A o 


where only the terms with the lowest order Q-variation have been 
retained. Denoting the angle between the vectors Q and r by y, 
so that Q.r = Qrcosy, the value of y (for a given Q and a fixed 
point in the solute particle) will vary between 0 and 7 radians as 
the orientation © changes; the angle of the projection of r in the 
plane perpendicular to Q, x say, will correspondingly vary between 
0 and 27. Hence, the orientational average, over a solid angle of 4r 
steradians, is given by 


(Joe vr] = fo fon ab [ff arcosy? ar, 
Ve 7 Pi 


(S) 


where siny dy dy is the element of solid angle generated by in- 
finitesimally small changes in 7 and y. Reversing the order of the 
integrations, 


E [fee f anv a fa = e 
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where the integral of r? over the volume of the solute particle is 


independent of orientation ©. By analogy with classical mechanics, 
the latter is related to a radius of gyration, R,, through 


nS zJ” d°r , (5.22) 
(pere) = 


© 

In conjunction with eqns (5.19) and (5.21), therefore, the logarithm 
of the differential cross-section in the neighbourhood of the origin is 
given by 


so that 


l (5.23) 


d ® R: 
S) | ~ log, bee [AB | = (=) Q? 


where we have used the first-order term in the Taylor series expan- 
sion of log,(1—X) to obtain the second contribution on the right- 
hand side. Thus a plot of log,[do/dQ] versus Q? in the range 
0< QSR; 1 should be approximated well by a straight line with 
a negative slope of Re /3 and an intercept related to the molecular 
mass, concentration and density of the sample.’ 


The radius of gyration in classical mechanics 


The resistance offered by an object to a change in its linear motion, called 
inertia, is determined entirely by its mass, M; the size and shape of the 
entity have no bearing on the issue (in vacuo). The latter become impor- 
tant for rotational motion, however, where the difficulty experienced in 
trying to change the angular speed is determined by the mass distribu- 
tion of the object with respect to the axis of rotation. This is encapsulated 


in the moment of inertia, 
I = [f fro ae, 
V 


where p is the density within volume V and r, is the perpendicular dis- 
tance of position r from the axis of rotation. If the density is uniform, so 
that p(r)=M/V, then I = MRŻ where 


Ro = = fff r, 
V 


The radius of gyration, Rg, is then the distance that a point mass M would 
have to be from the axis of rotation to have the same moment of inertia as 
the given (extended) object. 


log (XY) = log X + log Y 


2 
ip eae ~ =e 
t Guinier (1939) 
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Although the R, of eqn (5.22) is called a ‘radius of gyration’, the 
expression is not quite the same as the one in classical mechanics. 
This technicality aside, it does give a measure of size in terms of an 
rms radius. The relationship between R, and the physical dimen- 
sions of the solute particles is simplest to work out for a sphere of 


radius R: 
1 R T QT 
> a 4 : 
Re = pops |" ar [sino ao fag, 
(0) (0) (0) 


where we have used the fact that d?r = r? sin 0 dr dô dọ in spherical 


polar coordinates, so that 
3 
m= E Ra (5.24) 


The resultant fall-off in the the differential cross-section given by 
eqns (5.23) and (5.24) can be verified by examining the behaviour of 
eqn (5.11) as Q—0; that is, by putting 


g g E 


sing % ar | and cosx & l->+3 
in eqn (5.10), where x = QR, and so on. 
For a rugby ball-shaped ellipsoid, with principal axes of half-length 
a, a and b, it can be shown that 


9 2 2 
R, = j= l (5.25) 


As required, it reduces to the spherical case of eqn (5.24) when 
a = b = R. A long cylindrical particle can be modelled with b > a, 
giving R ~ b/\/5, whereas a > b represents a disc-like object, hav- 
ing an R, © a /2/5 . Since the radius of gyration is dominated by 
the largest length scale, the Guinier region will be experimentally 
inaccessible if the solute particles are too extended in some sense, so 
that Ao > Q- 


min’ 


5.2.3 Porod’s law 


The Guinier relationship of eqn (5.23) quantifies the fact that the 
scattered signal is insensitive to the shape of the solute particles 
at the smallest Qs; only their overall size, in an rms sense, and the 
content of the sample matters. As with all Fourier data, information 
about structural detail is found at higher Qs. Even then, distinction 
can be difficult since differences are blurred out by the orientational 
averaging in eqn (5.7). 

The Q-dependence of the differential cross-section for a dilute so- 
lution of identical uniform spheres was given in eqn (5.11). The 
corresponding analytical derivation for particles of other shapes is 
not so easy, and cannot usually be stated in a closed form; formulae 


5.2 Size, shape and molecular form factors 


Table 5.1 Form factors for randomly oriented particles, of uni- 
form scattering-length density contrast 3; — 6o and volume Vp. 


Particle shape Differential cross-section 


(do/d®).. / (VÈ | G1 — Bol”) 


341(QR)]2 
Sphere of radius R l — | 
QL. an 
Thin rod of length L eee / SNY jr — ee 


Flat disc of radius R 


E 


for a few idealized cases are given in Table 5.1. When Q> R~1, the 
differential cross-section of eqn (5.11) falls off as Q74: 


N\ 1 2 R? 
(ia) ~ a Tr |1- Bol c0s*(QR). (5.26) 
el 


By comparison, the scattered intensity at high Qs decays as Q~? and 
Qt, respectively, for extremely thin discs and very long rods. The 
Q~* behaviour noted above for spheres is, in fact, representative of 
most objects (due to orientational averaging), as long as they are 
not vanishingly thin in any direction and have a smooth interface 
with a sharp SLD transition. This result was established theoreti- 
cally by Porod (1951,1952), and now bears his name. Averaging over 
the cosine-squared fringes in eqn (5.26), which gives 1/2, to mimic 
the effect of instrumental resolution at high Q, Porod’s law can be 


written as 
do NN 27Sp 2 
-= — — Bo f 2 
(aa), — GG) Bel 6.20 


for Q— oo, where Sp is the surface area of the particle (47R? for a 
sphere). 

If the interface between the solute and solvent or, more generally, 
the regions of the sample with SLDs of (3; and (, is diffuse, then the 
scattered intensity falls off much more rapidly, 


(5) 7 exp(—o?Q?) 
el 


dQ Q4 i 


where o is an rms measure of the boundary roughness. This is sim- 
ply a Debye—Waller effect, as discussed earlier in Sections 3.3.2 and 
4.1.1. 


123 
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fiear =Í 


5.2.4 Polydispersity 


So far, we have assumed that all the solute particles are identical 
in size and shape; this is known as monodispersity. If there is vari- 
ation, or polydispersity, but no correlation in the locations and ori- 
entations, the resultant differential cross-section is just a weighted 
average of the scattering from the individual constituents. Taking 
the case of spheres, for example, with a distribution of sizes defined 
by f(r), so that the fraction with radii between R; and Rə is 


Rə 


fio dr, 


Ri 


the differential cross-section is given by 


($) =- (5) slal) f f(r) iQ), ar. (5.28) 
0 


Q 
Equation (5.11) is recovered with the substitution of f(r) = 6(r—R), 
when all the particles have the same radius R. 
Polydispersity is usually investigated with a simple model for f(r). 
A fairly common choice is 


f(r) = Are" (5.29) 


which is known as a Schulz distribution. Its mean and variance are 
related to a and y through 


(ry =F ana ((r- Wy) 


_ are 
ya 


with the normalization constant being given by A = y!t°*/I'(1+a); 
three examples are shown in Fig. 5.6. Although the integral of eqn 
(5.28) can be evaluated analytically for a Schulz distribution, the 


r 


Fig. 5.6 The Schulz distribution: all three examples have the same mean, u, but 
standard deviations of 4/12 (solid), 4/3 (dotted) and 3/4 (dashed). 
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result is quite messy. In the vicinity of Q =0, however, the formula 
reduces to a quadratic form reminiscent of eqn (5.23), with R, now 
involving the ratio of the sixth and eighth moments of f(r), 


8 
pz = 3) (5.30) 


and the asymptotic value of the differential cross-section at the ori- 
gin being given by 


do 

dQ), 
As the spread of radial sizes decreases, so that f(r) — ô(r— R) and 
(r”) — R”, the formulae of eqns (5.30) and (5.31) revert to those of 
(5.24) and (5.18) respectively. When the polydispersity is substan- 


tial, the radius of gyration (and Vp) is weighted towards the larger 
particles. 


2 
= (5) = [A (r5) , (5.31) 
Q=0 


5.3 Assemblies and correlations 


The discussion of Section 5.2 was based on scattering from a dilute 
solution of non-interacting particles, or an equivalent system of SLD 
contrast variation. As the concentration rises, or the solute becomes 
more polarized, the assumptions of randomly located and oriented 
scattering centres underlying eqn (5.7) break down. 


5.3.1 Orientational alignment 


The simplest case of correlation is that of orientational alignment. If 
liquid samples are subjected to a shear stress (Hayter and Penfold, 
1984), or a magnetic or electric field is applied in suitable circum- 
stances, solute particles with non-spherical symmetry can be made 
to point in the same direction. The formula for the differential cross- 
section then differs from eqn (5.7) in its lack of averaging over ©: 


da ~ 
— 1Qer 43 
(a). x N | [ [9.0.2 dP)! (5.32) 
Və 


An idealized example of the scattered intensity from identical ellip- 
tical particles, of uniform SLD, with principal axes in the ratio 2:1:1 
aligned with the x, y and z axes respectively, is shown in Fig. 5.7(a); 
the incident beam is along z, so that Q, ~0, and the central portion 
of the detector is masked out. The corresponding randomly oriented 
situation is illustrated in Fig. 5.7(b). The advantage of alignment is 
clear to see: the non-spherical character of the solute particles is im- 
mediately obvious from the anisotropy in Fig. 5.7(a) but is lost in the 
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(a) (b) Qy 


Qa 


Fig. 5.7 The scattering intensity from a very dilute solution of identical ellipsoids, of 
uniform SLD, with principal axes in the ratio 2:1:1. The long axis is aligned with the 
x-direction in (a) and oriented randomly in (b), with the incident beam parallel to z. 


integration over ©. Indeed, the relative spread of the fringes par- 
allel to Q, and Q,, and the reciprocal relationship between length 
scales in real and Fourier space, tells us that the ratio of the long 
to short axis is 2:1 with the latter being along y. By contrast, the 
scattering pattern of Fig. 5.7(b) could easily be from a solution of 
polydispersed spheres. 


5.3.2 Positional correlation 


Whereas orientational alignment is advantageous for determining 
the shape of the solute particles, positional correlations make the 
analysis more difficult. To see this, let’s assume orientational (and 
size) invariance, so that 6.(r, ©) = 3.(r), and rewrite the SLD func- 
tion of eqn (5.3) in the form 


N 


B(r) = bo + Belr) @ >> d(r—ry) . (5.33) 


j=l 


With the aid of the convolution theorem of eqn (2.51), the Fourier 
transform of this G(r), and eqn (5.1), gives 


2 


do 


N 
l x | (2r)? 8(Q) + F(Q) 3 g aer (5.34) 


where 
= 6, 1Qer 3 . . 
F(Q) JIJ? (wje EdT (5.35) 


This is very similar to the expression earlier in the chapter, except 
that previously F also depended on ©, and had to remain inside 
the summation symbol. Ignoring the 6(Q) contribution when the 


modulus-squared is evaluated, because the central region close to 
the origin is not accessible experimentally, the differential cross- 
section reduces to 


(sa). x |F(Q)|’ {N+ 25 cvs [Q-(e)—1y)]} (5.36) 


j>j' 


for Q #0. If the locations of the particles are uncorrelated, then 
the contribution of the 747’ summation will be negligible compared 
to N due to the incoherent addition of the cosine terms; this was 
assumed in Section 5.2 whence, bar the orientational averaging, eqn 
(5.36) reduces to eqn (5.7). If there is a discernible pattern in the 
positions {r;}, however, the differential cross-section is no longer 
simply proportional to the modulus-squared of the molecular form 
factor, F(Q). In that case, 


do 


(sa). x |F(Q) |” S(Q) |, (5.37) 


where S(Q) corresponds to the term in curly brackets in eqn (5.36) 
and, since it depends on the positional correlations between the so- 
lute particles, is known as a structure factor. 

In small-angle scattering experiments, where the main interest 
is in the building blocks of the sample, an S(Q) other than a con- 
stant (N) is a complication that is best mitigated through dilution; 
sometimes, such as with highly polar macromolecules or porous ma- 
terials, it is just unavoidable. In the study of amorphous materials, 
which are entirely dependent on local interactions and positional 
correlations, the structure factor plays a central role. We will leave 
a discussion of S(Q) to Chapter 6, therefore, which is concerned 
with that topic. 


5.4 Pair-distribution function 


The basic scattering equation, (5.1), gives a highly non-linear rela- 
tionship between the SLD function, (r), and the differential cross- 
section for elastic scattering, (do /d 2). due to the modulus-squared. 
The difficulties resulting from this loss of Fourier phase information, 
in terms of ascertaining the sample structure from the scattering 
data easily and uniquely, were highlighted in Section 2.6. As indi- 
cated in Section 2.4.2, however, (da/d2)., does depend linearly on 
the ACF of 3(r): 


(5) e f f J ACF[A(r)] end, (5.38) 
- V 
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where 


ACF[A(r)] = f f i B(r’)* B(r’+r) dr’. (5.39) 
V 


Thus, scattering data really tell us about the correlations within the 
SLD function at different separations r. 

For randomly located solute particles, when the S(Q) in eqn (5.37) 
is a constant, the only coherent contribution to the ACF comes from 
the internal structure of the macromolecules. If the latter have no 
preferred orientation, then the ACF is spherically symmetric: 


g(r) Tor P< dmax, 


ACF[@(r)] = (5.40) 


0 otherwise , 


where r= |r| and dmax is the longest dimension within the solute 
particle (e.g. the major axis of an ellipsoid). Following eqn (5.12), 
therefore, eqn (5.38) reduces to the one-dimensional integral 


dmax 

do s sin(Qr) 

(S). x an fr g(r) oe dr. (5.41) 
0 


If the particles have a uniform SLD, 61, then the probability of find- 
ing two points within them separated by a distance of r to r+dr is 
given by P(r) dr where 


P(r) x 4mr?g(r). (5.42) 


The prefactor of 47r? comes from the integration of the ACF of eqn 
(5.40) over all directions for a given separation r. P(r) is known as 
a pair, or distance, distribution function, and is plotted for several 
idealized cases in Fig. 5.8. 


P(r) P(r) 


(a) (b) 


O 


Fig. 5.8 The pair-distribution functions for (a) a sphere (blue) and ellipsoids with 
principal axes in the ratio of 4:1:1 (dashed) and 1:4:4 (dotted), of uniform SLD, all 
having the same radius of gyration; (b) composite particles of two identical ellipsoids, 
with principal axes in the ratio of 2:1:1, joined (long) end-to-end (blue) and side-by- 
side (grey). The radii of gyration are indicated by arrows on the r-axes. 


Mathematically, P(r) can be obtained from the differential cross- 
section through an inverse Fourier transform: 


27 do\ . 
P(r) x = far (sq) sir) dQ. (5.43) 
0 


Such a direct inversion of the scattering data is hampered by their 
incomplete and noisy nature, and the complication that (da/d2),, is 
related to the intensity measurements through eqn (5.15) involving 
a resolution function and background signal; these difficulties were 
discussed in Section 2.6. Practicalities aside, the radius of gyration 
of eqn (5.22) can be ascertained from the pair-distribution function 
through its second moment: 


i 3 fr? P(r) ar. (5.44) 


The factor of a half comes from the fact that the r above refers to 
the distance between two arbitrary points within the solute parti- 
cle whereas in eqn (5.22) it denotes the distance of a single point 
from the centre of symmetry. To see this, let r} and rə be displace- 
ments from the natural origin and rı2 be the vector representing 
the separation between them. Taking the dot product of each side of 
rı2 = rə — rı with itself, which gives the cosine formula for a trian- 
gle, and averaging over all the different possible pairs of positions 
within the solute particle, 


(rio) =a (r?) + (r2) = 2(rir2cos6) , 


where @ is the angle between rı and r2. Equation (5.44) follows from 
the realization that (r?) = (rz) = R2, (rir2cos@) = 0 and the left- 
hand side is the second moment of P(r). 


5.5 Contrast matching 


For simplicity, we have generally assumed that the macromolecules 
of interest can be modelled as particles with a uniform SLD, (3). This 
is usually adequate since small-angle scattering experiments probe 
structure on length scales of several atomic diameters, or more, so 
that only an average SLD is seen. If the object consists of distinct 
subunits, however, then this approximation may be too poor to ex- 
plain the scattering data. A common case occurs when a globular 
molecule is enveloped by a layer of another, so that the structure is 
better described by a core-shell model: 


Bı for |r|<R,, 


B-(r,O) = (5.45) 
ae É Rı<|r|< R2, 


5.5 Contrast matching 129 


dm ax 


[risi 


o 


130 Small-angle scattering and the big picture 


Bo 


Bi 


(a) (b) (c) 


Fig. 5.9 A schematic illustration of contrast matching: (b) 6o = 61, and (c) Bo = Ga. 


and zero otherwise. Following eqns (5.4), (5.8) and (5.10), this leads 
to 
do An\ 
2A NIŻ 
(55). * ( Q ) 


The analysis of the data will be simplified, and our ability to distin- 
guish between the two components improved, therefore, if separate 
measurements can be made with the SLD of the solvent, 3,, chosen 
to equal 6ı and 62. Such a contrast match is easy to arrange for neu- 
trons, with a suitable mix of hydrogenated and deuterated versions 


(b1— bo) Rf j (QR) + (5.46) 


(a= 8o) [R3 j(QRə) = R2j(QR:)| 


H='H of the solvent. Since H2O and D20 have SLDs of — 0.56 x10~® i and 
+ 6.38 x107 A “, respectively, for example, values of 6, within this 
D ="H range are readily accessible for water-based solvents. The benefits 


of contrast matching are illustrated schematically for a core-shell 
structure in Fig. 5.9. 


Liquids and amorphous 
materials 


Liquids and amorphous materials are characterized structurally by 
their localized order; on length scales larger than a few nanometres, 
they exhibit no discernible pattern. 


6.1 The middle phase of maiter 


Ice, water and steam are chemically the same material, H2O (Fig. 
6.1), but they differ greatly in their physical properties. Below 0°C, 
and under normal atmospheric pressure, the water molecules are 
held in place by a regular and repeating network of hydrogen bonds; 
this solid crystalline ice state is defined by its long-range structural 
order (Fig. 6.2(a)). At the other extreme, above 100°C, is steam: 
the random gaseous phase consisting of well-separated molecules. 
Water occupies the middle ground. While there is enough thermal 
energy in the system for the hydrogen bonds to be broken easily, 
there is still an electrochemically driven tendency for neighbouring 
water molecules to orient themselves in certain directions with re- 


(a) (b) 


Fig. 6.2 (a) Normal ice, with a hexagonal crystalline structure. (b) A snap-shot of 
water, showing short-range molecular ordering but a generally disordered structure. 
(Courtesy of the MathMol Project, New York University.) 


e a 


Fig. 6.1 With two covalent bonds and 
two lone pairs per oxygen atom, water 
molecules are tetrahedral in shape. 
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spect to each other (Fig. 6.2(b)). This leads to some degree of molec- 
ular ordering over short length scales, but the correlations become 
weaker with distance to leave no discernible pattern over separa- 
tions of more than a few nanometres. Although these structural 
characteristics are typical of liquids, they can also be found in solids; 
the latter are known as amorphous materials, with glasses being one 
of the most common examples. 


6.2 Radial distribution functions 


Given their lack of a well-defined structure, liquids and amorphous 
materials are best visualized through their radial distribution func- 
tions. We met an example of the latter in Section 5.4, where P(r) dr 
was equal to the probability of finding two points separated by a 
distance of r to r+ dr within the macromolecular object of inter- 
est. The main difference with the small-angle scattering case is that 
the molecules comprising the sample are now small, and densely 
packed, and the measurements encompass much higher values of 
Q. As such, P(r) contains information on interatomic distances both 
within molecules and between them. 

To understand the interatomic link better, let us write the SLD 
function, (r), in the form of eqn (3.34): 


N 
Br) =) GN); (6.1) 
j=1 


where 6;(r) is the SLD distribution of atom j when centred at the 
origin (r; =0). The ACF of (r), which is related to the differential 
cross-section linearly through 


(33) n J J J ACE([A(r)] clr" de, (6.2) 
i V 


is then given by 


ACF|A(r)] = 1 Soa D T rr. 
vo j=l k=1 


When the summations are expanded and multiplied out, and the 
order of the integration and summation reversed, the ACF can be 
decomposed into a combination of two terms: 


ACF[6()] = g(t) + g2(r). (6.3) 


where 


N 
gi(r) = Blr’ —r;) Bi(r'—r;+ r) der’ (6.4) 
3 JIJ J J J J 


6.2 Radial distribution functions 133 


and 
go(r) = pD JJ B;(r’—r;)° Bx(r’—r_t+ 4) r. (6.5) 
GFK y 
The first, gı (r), is fairly uninteresting, as it results from the correla- 
tion of each atom with itself. It has a maximum at the origin, 


gı(0) = S J10 r, 


and decays rapidly to zero with increasing r; the fall-off occurs over 
an atomic diameter for X-rays and a much shorter distance, deter- 
mined by the rms thermal motions of the atoms, for neutrons. The 
second term, g2(r), depends on the spatial distribution of the atoms, 
and their types, and conveys information on the structure of the 
sample; let’s examine this further. 

For a sample consisting of N atoms, eqn (6.5) entails the sum- 
mation of N(N—1) functions of r. Each of these is the result of an 
integral that represents the correlation between atom j and atom k, 
with 74k, which will have a maximal value (in absolute terms) of 


S| [HEY auc) a 


when r=r;,—r, and then decay to zero (within an atomic diameter 
for X-rays, or a few times the rms thermal amplitude for neutrons). 
There will be an equivalent contribution when the suffices j and k 
are interchanged, because the same two atoms are involved, so that 
go(r) will be symmetric: 


go(—r) = g(r)”. (6.6) 


We have included the complex conjugates in the analysis for strict 
technical correctness, but they are unnecessary in our implicitly as- 
sumed case of real SLDs. While the relationship of eqn (6.6) holds for 
any type of sample, g2(r) has to be spherically symmetric for liquids 
and amorphous materials, 


go(r) = go(r) 


where r= |r|, as the constituent molecules can be found in all orien- 
tations; this symmetry also applies to g;(r). 

The discussion above indicates that gə(r) will contain a significant 
signal only at values of r that correspond to interatomic distances 
present in the sample. This will always be positive for X-rays, but 
can be negative for neutrons since the scattering lengths of the rel- 
evant pair of atoms can have opposite signs. The ‘peaks’ closest to 
the origin will be defined most clearly, and will stem from the fixed 
intra-molecular atomic distances and the nearest-neighbour inter- 
molecular ones. In water, for example, these would be the covalent 


R 
O 2 4 6 d 


Fig. 6.3 A simulation of the neutron 
pair-correlation function, g(r), for deut- 
erated water. (Soper, 2000). 
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O-H and hydrogen Oi'iH bond lengths, at r =1.0A andr ~ 1.9Å 
respectively, and the H.--H distance of r = 1.6A (Fig. 6.3). As r 
increases beyond a few nanometres, the lack of a long-range order 
will lead to a uniform continuum of interatomic distances. We can 
estimate the asymptotic value of go(r) by substituting the average 
value of the SLD, (8), into eqn (5.39): 


g2(r) — (b) V, 


where V is the ‘illuminated’ volume of the sample (assumed small 
compared to the total). 

To aid a comparison between different samples, it’s helpful to work 
with a normalized version of gə(r) which tends to unity for large r: 


d=, (6.7) 


This is called a pair-correlation, or a pair-distribution, function but 
is often simply referred to as g(r). Other functions that are closely 
related and commonly used include 


P(r) = 4rr?° g(r) and T(r) = 4rrg(r). (6.8) 


They all convey the same structural information, and really only 
differ in their asymptotic behaviour at large r: P(r) x r?°, T(r) xr 
and g(r)— 1. While only P(r) can strictly be called a radial distri- 
bution function, since it entails the integral of a three-dimensional 
function, g(r), over all directions for a given value of |r], 


2m T 
P(r) = fax fso 7? sin op deb, 
0 (0) 


which reduces to 477? g(r) as g(r) is independent of the spherical 
polar angles x and w, T(r) and g(r) can also be considered as such 
because they too depend only on |r|. The three functions, g(r), T(r) 
and P(r), are illustrated in Fig. 6.4. 


6.3 Structure factors 


The linear relationship between the differential cross-section and 
the ACF of the SLD function enshrined in eqn (6.2) means that the 
decomposition of the latter into g;(r) and go(r) in eqn (6.3) must be 
mirrored in the scattering data. To see this, let’s start by taking the 
Fourier transform of (3(r) in eqn (6.1): 


[|| Ar) & Or dêr = EC) Haney, a 
V j= 


Å 
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Fig. 6.4 The radial distribution functions g(r), T(r) and P(r), as defined in eqn (6.8), for germania glass (vitreous GeO2); the 
expected asymptotic behaviour at large r is indicated by the dashed grey lines. (Hannon et al., 1990) 


where atom j is located at r; and has a form factor f;(Q), 


f(Q) = Jija% ed i (6.10) 


Form factors were previously discussed in Sections 3.2 and 3.3, and 
denoted by f;(\,6). Since the differential cross-section is given by 
the modulus-squared of the Fourier transform of 6(r), as in eqn (5.1), 
the product of the sum on the right-hand side of eqn (6.9) with its 
complex conjugate leads to 


do 
(a). oe Sı (Q) + S2(Q) ; (6.11) 
where P 
S:(Q) = X JEQQ (6.12) 
j=1 
and l 
S2(Q) = 2 Y Re SE (Q) f(Q)" A ) (6.13) 


j>k 
The first term depends purely on the scattering properties ofindivid- 


ual atoms, and not on their locations; as such, Sı (Q) is the Fourier 
counterpart of g(r), 


SQ) = fff ei) do" or, 
V 


and is of little intrinsic interest. Structural information about the 
sample is contained in S2(Q), since it depends on the relative posi- 
tions of different pairs of atoms, which makes it the Fourier coun- 


terpart of gə(r): 
S2(Q) = MES paer aay. (6.14) 
V 
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The disordered nature of liquids and amorphous materials means 
that, just like g)(r) and go(r), S;(Q) and S2(Q) are spherically sym- 
metric; that is, they depend only on |Q]. 

Equations (6.2) and (6.11) were expressed as proportionalities to 
allow for the various conventions used in the normalization of the 
differential cross-section. For the most common choice of a division 
by the number of atoms, N, 


R 
dry 1 = 2 An(B)?V 5 sin(Qr) 
(sa) =- w 2, 182) a EE. Boe O° 


where eqns (5.12) and (6.7) have been used to rewrite the Fourier in- 
tegral of eqn (6.14) in its one-dimensional form and R= (3V/ 47r)" R 
The first term on the right-hand side is simply the average value of 
the modulus-squared of the form factor, ( | f (Q)? ), and the second 
is better evaluated by calculating the Fourier transform of g(r) —1 
instead of g(r): 


R R ; 
4 0 


where the discrepancy A(Q), being the Fourier transform of a con- 
stant (unity), is peaked sharply around the origin (Q < 27/R). Since 
the region close to Q =0 is not accessible experimentally, as it coin- 
cides with the straight-through unscattered beam, and because the 
upper limit of the integral on the left-hand side can be replaced with 
infinity as g(r)—1— 0 for large r, 


do 2 
al = (£(Q)| ) S(Q), (6.15) 
where the structure factor, S(Q), is given by 
SiOy= i+ SO frlet) sin(Qr) dr | , (6.16) 
0 


for Q40, where 
(BYV 


TEON . (6.17) 


a(Q) = 


The expression for a is independent of Q for neutrons, as f;(Q) is 
simply equal to the (coherent) scattering length of the jth nucleus, 
(b;). With (3)= n (b), where n = N/V is the atomic number density, 
the formula for a reduces to just 


a(Q) =n 
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for neutron scattering from a monatomic sample. 
Mathematically, eqn (6.16) can be rearranged to give g(r) in terms 


f S(Q): 
of S(Q) S 
i “Fp. eO- 
g(r) = ] + TEF Je aQ) sin(Qr) dQ. 1 
0 
As a means of obtaining the pair-correlation functions from the scat- 
tering data, however, such a direct inversion is fraught with many 
practical difficulties; these were discussed in Section 2.6. While the j T Q(a7') 


lack of phase information in diffraction data is not an issue for g(r), 
the limited extent of the measurements in Q, their noisy nature, the Fig. 6.5 The neutron scattering data 
blurring from the instrumental resolution, the (incoherent) back- related to the pair-correlation functions 
ground signal, and other experimental considerations, all need to be °° Fig. 6.4. (Hannon et al., 1990) 
taken into account. 


6.3.1 Partial structure factors 


The pair-correlation function is easiest to interpret for a monatomic 
sample, as all the ‘peaks’ in g(r) correspond to distances between 
atoms of the same type. The situation becomes moderately compli- 
cated even for the diatomic case, such as water or a binary alloy, 
since g(r) is then a conglomeration of three sets of separations: AA, 
AB and BB, where the two elements have generically been labelled 
as A and B. There are six pairwise combinations for a triatomic sam- 
ple, and so on. 

For a material consisting of M different types of atoms, g(r) can 
be decomposed into a linear combination of M(M +1)/2 partial pair- 
correlation functions, g;g(r): 


M M 
g(r) = >> SO gagat), (6.18) 


where the coefficients, y;x, are fixed by the requirements of consis- 
tency with eqns (6.5) and (6.7). With g;x(r)— 1 for large r, g(r) can 
only do likewise if the coefficients sum to unity: 


M M 
DoD ee = 1. (6.19) 


J=1 K=J 


In conjunction with the observation that 


Vow X (2—0d)x) Cj CK (b;) (bx) ; (6.20) 
where the Kronecker delta, ôx, equals 1 if J= K and 0 otherwise, M 
and c; is the fraction of atoms of type J in the sample, the constant of D= >, cz (bz) 
proportionality is found to be the inverse of the square of the average J=1 


scattering length, (b) 
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Fig. 6.6 The partial pair-correlation functions, gou(r), guu(r) and goo(r), corresponding to the structural simulation of water 
in Fig. 6.3. (Soper, 2000) 


The decomposition of eqn (6.18) is mirrored in Q-space, with S(Q) 
being a linear combination of M(M+1)/2 partial structure factors, 


Six(Q): - Ee 
= ` bD Vax SJK (6.21) 


J=1 K=J 


This equation, and the link between the partial structure and pair- 
correlation functions, 


CO 


onl): = 14 a) fr Elt 1] sin(Qr) dr (6.22) 
0 


is easily established by substituting for g(r) from eqn (6.18), and for 
unity from eqn (6.19), into eqn (6.16). 

The analysis above tells us that information on the distribution 
of distances between atoms of type J and K, enshrined in g(r), 
is contained unambiguously in the corresponding partial structure 
factor, S;.(Q). To see how the latter could be ascertained from mea- 
surements of the differential cross-section, let’s explicitly consider 
the case of a binary sample. With atoms of only type A and B, eqn 
(6.21) becomes 


eR (ba)” Saa(Q) + 2 (bs)” Spe (Q) + 2cace (ba) (bp) Sap(Q) 
[ca (ba) +p (bp) |” 


so that the scattering data, S(Q), are a (known) linear combination 
of the three partial structure factors S44(Q), Sge(Q) and S4p(Q). 
If it were possible to alter the scattering lengths of the atoms, (ba) 

r (bp), or both, while keeping everything else the same, then we 
would obtain scattering data pertaining to a different weighted sum 
of the desired partial structure factors. In the absence of experimen- 
tal limitations, three such sets of measurements would be sufficient 
to determine S44(Q), Sge(Q) and S4n(Q) uniquely since, for a given 
Q, it would amount to solving three linear simultaneous equations 
for three unknowns. A generalization of this argument leads to the 


S(Q) = 


7 
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conclusion that the partial structure factors of an M-element sam- 
ple could be disentangled in this way by combining the data from 
M(M-+1)/2 suitable differential cross-sections. The nuclear depen- 
dence of neutron scattering lengths makes this a viable strategy if 
sufficient well-calibrated isotopic variations of the sample can be 
prepared. 


6.4 Comparison with small-angle scattering 


We began our discussion of liquids and amorphous materials by not- 
ing that they represent a state half way between perfect order and 
complete disorder. Samples studied with small-angle scattering, by 
contrast, are ideally considered to have a gaseous structure. This 
distinction lies at the heart of the apparent difference between the 
associated analyses. 

The common theoretical origin of the material in Chapters 5 and 
6 is underlined by the similarity of eqns (5.37) and (6.15), where the 
differential cross-section is expressed as the product of a form factor 
term and a structure factor. The former is defined by the nature of 
the basic constituents of the sample, whereas the latter depends on 
their relative locations. The analysis of small-angle scattering ex- 
periments is simplified through the ‘dilute solution’ approximation 
whereby the macromolecules, or density fluctuations, of interest are 
spread so far apart as to be non-interacting. This is, in essence, the 
ideal gas model. Without positional correlations, the structure fac- 
tor is featureless and the differential cross-section depends only on 
the size and shape of the constituent particles. The assumption of a 
dilute solution becomes poorer as the concentration rises, and even- 
tually the approximation of a uniform structure factor is no longer 
adequate. Both S(Q) and the form factor then play an important 
role in our structural understanding of the sample from small-angle 
scattering measurements. 

In the study of liquids and amorphous materials, the constituent 
particles are composed of only a few atoms, sometimes just one, and 
they are not separated by an intervening solvent or air-like matrix. 
The form factor term is of little interest, therefore, as it simply tells 
us an average property of the atoms in the sample. For neutrons, it 
merely constitutes a multiplicative constant as far as the differen- 
tial cross-section is concerned; for X-rays, it’s a modulation that de- 
cays slowly with Q. The structural information is contained entirely 
in S(Q) which, in turn, is a reflection of the local atomic interactions 
in the system. 


6.5 The Placzek correction 


For simplicity, the analysis in this chapter, as in the preceding three, 
has been based on the assumption of elastic scattering. That is to 
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say, it hinged upon the Fourier relationship between the differential 
cross-section and the SLD function derived in Chapter 3: 


(5), Il plr) do de i (6.23) 


where the wavevector transfer is given by eqn (3.8). As noted in 
Section 3.1.3, diffraction experiments generally don’t involve energy 
discrimination and rely on the property that the scattering tends to 


be predominantly elastic: 

do do do do 

dQ 7 (sa), (ia), (55). l i 
While inelastic events can be ignored to a very good approximation 
in solid samples, this is not the case for liquids since their atomic 
structure cannot reasonably be regarded as being fixed. A proper 
discussion of the issue will have to wait until Chapter 8, when the 
basic ideas of inelastic scattering are introduced, but we state here 
that eqn (6.23) holds for the total differential cross-section as long 
as the energy transferred is small compared to the incident kinetic 
energy of the X-ray or neutron (so that |k;| ~ |ky|); this is known as 
the static approximation. Under this condition of ‘virtual elasticity’, 
da/dQ is related to the Fourier transform of an instantaneous view 
of the ever changing detailed structure. 

Following the work of Placzek (1952), a correction is usually ap- 
plied to scattering data from liquids to account for deviations from 
the static approximation. The related multiplicative factor is gen- 
erally small and becomes increasingly negligible for heavier atoms 
and higher incident energies. 


Periodicity, symmetry and 
crystallography 


Our final look at elastic scattering concerns the study of the most 
highly ordered phase of matter: the crystalline state. This gives rise 
to diffraction patterns with very sharp and clearly defined structure. 
For this reason, it is the field in which the earliest (X-ray) scattering 
experiments were conducted. 


7.1 Repetitive structures and Bragg peaks 


The central property of crystalline materials, and the feature that 
endows them with long-range order, is their inherently repetitive 
nature (Fig. 7.1). This underlying periodicity in the structure can be 
stated mathematically as 


P(r) = blr + na + nb + ngc) (7.1) 


for any set of integers nı, n2 and n3. That’s to say, the SLD at a given 
location in the sample, r, is the same as that at a point translated 
by integer multiples of the vectors a, b and c with respect to it. The 
parallelepiped formed by a, b and c defines the unit cell, or basic 
building block, of the crystal: identical copies are stacked in a three- 
dimensional lattice to create the periodic structure. The lengths of 
the vectors (a, b, c) and the angles between them (a, 8, y) are known 
as lattice constants. 


Fig. 7.1 A two-dimensional illustration of the fundamental periodicity in crystals: 
the stars mark equivalent points, related by translational symmetry. 
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glam =j 


Veell = |ae(bxc)| 


As always, the principal equation linking the SLD function to the 
differential cross-section for elastic scattering is 


(iA), x | [ff Pear i (7.2) 
V 


where V is the volume of the sample illuminated by the beam of X- 
rays or neutrons. For a G(r) with the translational symmetry in eqn 
(7.1), the Fourier transform is related to an integral over the volume 
of a single unit cell, Vse: 


JII B(r) $ dr = Lp(Q) ite ciQer 43, (7.3) 
: 


Veell 
where 


Lr(Q) = apes exp[iQ.(nja+ nob+4+ nzc)] . (7.4) 


n2 na 


The summation of the imaginary exponential over n1, n2 and ns will 
cancel out to zero unless the terms add up coherently. For the latter 
to happen, Q must satisfy the condition 


Q.(matnob+ngc) = o + 2T, (7.5) 
where ¢, is a constant and n is an integer. This is achieved when 
Q=hA+kB+1C (7.6) 


with h, k and / being integers and 


(7.7) 


Using the properties of scalar triple products, the vectors A, B and 
C can be shown to obey 


QcA = b.B=c.C=2r and 


7.8 
aeB=aeC=beA=b.C=c.A=c.B= 0, l ) 


so that n = nıh + nak +nsl and ¢,= 0 in eqn (7.5). Hence, Lr (Q) 
will have a value V/V.., the number of illuminated unit cells, if Q 
satisfies eqn (7.6) but will be vanishingly small otherwise. As such, 
the non-zero points of Lr (Q) define a regular three-dimensional grid 
in Q-space, 


Lr(Q) x XXO Y 6(Q-(hA+kKB+IC)), (7.9) 
h k l 


which is known as the reciprocal lattice; the vectors A, B and C are 
called reciprocal vectors. 
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Fig. 7.2 (a) A two-dimensional crystal lattice, L(r), defined by the unit cell vectors a 
and b. (b) The corresponding reciprocal lattice, Lp (Q), with reciprocal vectors A and 
B: a is perpendicular to B, b is orthogonal to A and their lengths satisfy the inverse 
relationship A/B = b/a. 


The conclusions of the preceding analysis, that the scattering from 
a crystalline sample is non-zero only at sharp well-defined points in 
Q, can also be arrived at with the convolution theorem of Section 
2.4.1. To see this, we simply need to note that the repetitive struc- 
ture encapsulated by eqn (7.1) can be expressed as a convolution of a 
single unit cell and a three-dimensional array of 5-functions defining 
the crystal lattice, L(r): 


B(r) = Lr) ® Been(r) (7.10) 


where 


L(r) = ae? ô(r — (ma+neb+ngc)). (7.11) 


n2 n3 


According to the convolution theorem, therefore, the Fourier trans- 
form of @(r) will be equal to the product of the Fourier transforms 
of L(r) and the SLD of a unit cell. This is exactly eqn (7.3), with the 
identification that 


LR(Q) = | [fw gar g3y, (7.12) 
V 


While the quantitative result of eqns (7.7) and (7.9) follows from the 
earlier argument, it can be understood qualitatively as being a gen- 
eralization of the diffraction pattern from a grating given in Section 
2.5.3: it’s just a three-dimensional array of 6-functions with ‘recip- 
rocal spacing’. Although the Fourier transform of (..1;(r) is defined 
for all Q, the discrete nature of La(Q) with which it is multiplied 
means that the scattering from an ideal crystalline sample can be 
non-zero only for very specific values of Q. These isolated points of 
scattered intensity are known as Bragg peaks. 
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7.1.1 Atomic planes and Bragg’s law 


The discrete and ordered nature of the scattering from crystals can 
be thought about in another way which provides a more direct link 
with the atomic structure. It relies on the basic reciprocity of Fourier 
transforms, illustrated in Fig. 7.3, whereby a functional width of w 
in real-space maps to a spread of 27/w in the corresponding direc- 
tion of diffraction-space, and repetitive features with a spacing of d 
in r appear with a periodicity of 27/d in Q. The pattern of Bragg 
peaks can then be understood as the signature of diffraction from 
(vast) parallel arrays of atomic planes, with different spacing and 
orientations. If the unit vector normal to the planes is n and they 
are a distance d apart, so that 


ren = Nd+A (7.13) 


where A is an offset and N=0,+1,+2,+3,..., they will give rise to 
a line of diffraction peaks at 
2 
Q =G. (7.14) 
d 
When the modulus of this wavevector transfer is equated with the 
expression in eqn (3.7), 
4r sin 0 27N 


goa - =, 


it leads to the most well-known result in scattering theory: 


NA = 2dsiné |, (7.15) 


or Bragg’s law. 


(a) Y (b) Qy 
A A 


Fig. 7.3 (a) The aperture function for a two-dimensional grating of line spacing d 
and longitudinal width w. (b) The corresponding diffraction pattern. 
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An elementary derivation of Bragg’s law 


The idea that X-rays might be diffracted by a crystal lattice was first pro- 
posed by Max von Laue (1912); it was subsequently observed by Friedrich 
and Knipping (1913), and confirmed the wave nature of X-rays. Lawrence 
Bragg (1912) extended this work by providing an elementary argument 
that helped with the understanding of the scattering pattern and, along 
with his father Henry Bragg (1913), went on to build an X-ray spectrome- 
ter and solve some simple crystal structures. 

Bragg imagined that, theoretically, the situation would be similar to 
the case of specular reflection from an array of parallel planes. Following 
the geometrical setup in Fig. 7.4, with an interplanar spacing of d and a 
scattering angle of 20, the difference in the length of the path travelled by 
waves reflected from adjacent planes, AJL, is 


AL = 2asm?é. 


Constructive interference between all the outgoing waves will only occur, 
however, when this path difference is a whole number of wavelengths, A: 


i = NY), 


where N is an integer. The combination of these equations leads to the 
Bragg condition for a scattering signal to be detected: 


NA = Dal sine. 


Although eqn (7.15) provides a simple relationship between the 
interplanar spacing and the resultant scattering angles for X-rays, 
or neutrons, of a given wavelength, it loses the vectorial content of 
eqn (7.14). Specifically, that the direction of Q is perpendicular to 
the atomic planes responsible for the scattered signal. The wavevec- 
tor transfer can always be defined by two scattering angles (9 and ¢) 
and à, as in Fig. 3.3 and eqn (3.8), but in the crystallographic case it 
is also specified by the three integers in eqn (7.6). Hence, the Bragg 
peak indexed by h, k and l (with 6=6,,;,) corresponds to planes with 
a normal direction, naki, given by 


x À 
finki = ——— (hA+kB+IC), (7.16) 
At sin hkl 


where the scalar prefactor ensures unit length. This representation 
of the orientation of the planes, which is expressed in terms of the 
reciprocal vectors, can be transformed into a lattice-based picture by 
substituting it into eqn (7.13), the equation of the planes, with the 
general point r written as 


r=wvwativb+we. (7.17) 


Fig. 7.4 Bragg’s setup for reflections 
from uniformly spaced layers. 
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In conjunction with the orthogonality relations in eqn (7.8), it leads 
to the following condition on u, v and w for r to be on a plane per- 
pendicular to f pkı: 

À 


——— (hu+kv+lw) = Ndr +A. 
2 sin Ôhki 


Thus the orientation of the atomic planes is such that they cut the 
a-axis (where v = w = 0), the b-axis (with u = w = 0) and the c-axis 
(when u=v=0)at 


1 l (7.18) 


respectively, where ņ is a constant. 


7.1.2 Simple consequences and applications 


Before moving on, it is worth noting some elementary consequences 
and applications of Bragg’s law. For example, the physical bound of 
180° on scattering angle 20 can be combined with eqn (7.15) to yield 


NA 
3d 
With the integer N denoting the order of the reflections with respect 
to the straight-through beam (which is coincident with N = 0), this 
shows that no Bragg peaks will be observed if \ > 2d. Too short a 
wavelength, when \<d, might also be a problem as the large num- 
ber of allowed reflections become difficult to distinguish as separate 
entities. 

Although the determination of enough interplanar d-spacings can 
pin down the atomic positions in very simple materials, most sam- 
ples require a much more detailed analysis of the diffraction pattern 
before their structure can be inferred. There is an engineering ap- 
plication, however, that relies primarily on the use of Bragg’s law: 
the measurement of residual stress. The processes involved in the 
manufacture of mechanical components, such as railway tracks and 
turbine blades, often lock-in latent stresses that can contribute to 
their premature failure. These can be probed by examining the re- 
sulting distortions incurred by the crystal lattice. If a tensile stress 
o causes the interplanar spacing to increase from a value of d, in an 
annealed sample to d in the worked material, for example, then its 
magnitude can be estimated from the resultant strain, (d—d,)/do, 
through a multiplication with the corresponding Young’s modulus of 


elasticity, F: 
d B do 
= E. 


The most common ‘direct use of Bragg’s law is not for d-spacing 
determination but wavelength selection. When a polychromatic beam 


sind = <1. 


of X-rays or neutrons impinges on a crystal of known structure and 
orientation, only those with specifically chosen wavelengths emerge 
at a given scattering angle of 20: 
2d sin 6 
i= wT 

While an ideal monochromator would select a single à, the integer 
N in Bragg’s law means that particles with a half, a third, a quar- 
ter, and so on, of the principal wavelength could also pass through. 
Most of these are unlikely to be present in the polychromatic source 
on energetic grounds but, if better monochromation is required, the 
emergent beam can be passed through a second crystal with a dif- 
ferent d-spacing. 


7.2 Patterns and symmetries 


As mentioned earlier, the central feature of crystalline materials is 
their inherently repetitive nature. While they all share the trans- 
lational symmetry of eqn (7.1), which gives rise to the reciprocal 
lattice, many display patterns within the unit cell. These additional 
symmetries in the SLD, in turn, lead to patterns in the intensities of 
the Bragg peaks. 


7.2.1 Reality and Friedel pairs 


A general property of G(r) that we have assumed throughout this 
book, for simplicity, is the reality of the SLD: 


By ape. (7.19) 


As noted in eqn (2.47), its Fourier transform, 


F(Q) = f | J B(r) $ dr, (7.20) 
V 


is conjugate symmetric: 
F(—Q) = F(Q)*. (7.21) 


For crystalline samples, F(Q) can have non-zero values only when 
eqn (7.6) is satisfied. As such, eqn (7.21) becomes 


where h = —h, k = —k and l = —l. Since the measured signal scales 
with IF|? = FF", the intensity of the Bragg peak indexed by hkl 
will be the same (subject to experimental corrections) as that for 
the reflection denoted by hkl; these are known as Friedel pairs. A 
violation of Friedel’s law is indicative of a breakdown of eqn (7.19), 


and is called anomalous scattering. 
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7.2.2 Centrosymmetry and reality 


With a suitably chosen origin, which is arbitrary in the absence of a 
Fourier phase, the SLD function may be symmetric with respect to 
an inversion through r=0: 


B(-r) = G(r). (7.23) 


Structures having this property are said to be centrosymmetric, and 
their Fourier transforms satisfy 


F(—Q) = FQ): (7.24) 


Assuming that the wavelength of the X-rays or neutrons is such that 
the crystalline sample contains no sources of anomalous scattering, 
eqn (7.24) can be combined with eqn (7.21) to yield 


The structure factors of centrosymmetric crystals can be represented 
by real numbers, therefore, as their phases can be restricted to take 
values of only 0 or z radians. 


7.2.3 Space groups and systematic absences 


Just as the translational symmetry of eqn (7.1) enables the SLD of 
the entire crystalline sample to be deduced from a knowledge of the 
contents of a single unit cell, so too internal symmetries enable the 
SLD of the complete unit cell to be deduced from knowledge of a 
certain fraction of it known as the asymmetric unit. For the simple 
case of centrosymmetry, only one half of the unit cell is required: 
O<u< , O<v<el, O<w<l, 

for example, where u, v and w are fractional unit cell coordinates, as 
in eqn (7.17), so that lattice translation reduces to 


B(u,v,w) = B(utn,vtne,wtns), (7.26) 


where ni, n2 and ng are integers. The SLD in the rest of the unit cell, 
with 1/2 <u <S 1, is given by 


B(u,v,w) = B(1—-u,1—v,1-w), 


where the locations parameterized by (u,v, w) and (1—u, 1—v, 1—w) 
are said to be equivalent positions. 

The number and type of equivalent positions define the symmetry 
of the crystal within its unit cell. There are 230 alternatives listed 
in Volume A of the International Tables for Crystallography, called 
space groups, and each leads to a corresponding set of relationships 
for its structure factors. Before illustrating this with an example, let 


us rewrite the Fourier transform of eqn (7.20) in a way that is better 
suited to crystalline samples; that is, in terms of Miller indices, hkl, 
and fractional unit cell coordinates. With Q and r given by eqns (7.6) 
and (7.17), and the orthogonality relations of eqn (7.8), 


Qer = 2n(hutkv+lw) and d°r = Vee dududw, 


so that eqn (7.20) becomes 


, (7.27) 


following eqns (7.3) and (7.9). 

Let us consider one of the most common space groups, P 2;/c, or 
number 14 in the International Tables, which has four equivalent 
positions: 


(u,v, w), (l-u,$+v,$-w) ; 


(7.28) 
(l—u,1l—v,1-w), (u,5—v,$+w) : 


with an asymmetric domain of 
O<u<l, O<v<4q, O<w<il. 


According to the fundamental translational symmetry of eqn (7.26), 
any fractional coordinate that goes outside the range of zero to one 
on applying the equivalences of eqn (7.28) can be reassigned a loca- 
tion within the [0,1] unit cell through the addition (or subtraction) 
of an appropriate integer. The Fourier transform of eqn (7.27) can 
then be evaluated through an integral over the asymmetric unit 


11/41 


Frei -vf [Pluv.w) ¢ &(h,k,l,u,v,w) dudvdw , (7.29) 
0 v=0 0 


where the function £ is a sum of four complex exponentials, e!@°", 
over the equivalent positions in eqn (7.28). After some trigonometric 
algebra, it is found that 


E = 4cos|Q2rku +2 ~(k+1)| cos|2m (hu +1w) z 5 (E+) (7.30) 


The fact that € is real, rather than complex, is a reflection of the 
centrosymmetry of the space group: the two equivalent positions in 
the second line of eqn (7.28) are related to those of the first by an in- 
version through the origin (and lattice translation). The other thing 
to note is that €, and hence F,gı, will be equal to zero if either of the 
cosines in eqn (7.30) is zero. As this will occur when k = 0 and / is 
odd, or when h=/=0 and k is odd, there will be a systematic absence 
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For =0 forl = 2n 
Foro =O fork 4 2n 


of certain Bragg peaks in the diffraction pattern. These are part of 
the fingerprint, in reciprocal space, of the symmetry within the unit 
cell. A more careful examination of the properties of eqn (7.30) leads 
to the conclusion that the structure factors come in related groups 
of four: 


Mia = Bye SS 1) a Pa (7.31) 


The systematic absences, or extinctions, then follow as special cases. 
When k=0, for example, 


Fror = ai Furor => Fiore =O for [= 2n+1, 


where n is an integer. The theoretical equivalence of the intensity 
of groups of reflections, such as the quadruples in eqn (7.31), known 
as the Laue symmetry, is used to calibrate experimental corrections 
such as absorption. 


7.2.4 Geometry and space groups 


Although there is no mathematical constraint on the allowed SLD 
pattern within a unit cell, physical considerations, such as the quan- 
tum mechanical rules for atomic and molecular bonding, mean that 
only certain types of symmetry can occur in practice; this limits the 
number of potential space groups to 230. Indeed, a knowledge of the 
chemical content of a unit cell can restrict the possible space groups 
further. If the sample is chiral, for example, as might be expected for 
a biological compound, and enantiomerically pure, then the internal 
symmetry of the unit cell can only be consistent with the 65 space 
groups that preserve handedness. 

As listed in the International Tables, each of the space groups is 
assigned a crystal system: triclinic, monoclinic, orthorhombic, tetrag- 
onal, trigonal, hexagonal and cubic. While these relate to the seven 


Table 7.1 The seven possible geometrical shapes of a unit cell, as defined by their 
lattice parameters, and the space groups that share their characteristic symme- 
tries (numbered in accordance with the International Tables of Crystallography). 


Crystal system Unit cell geometry Space group numbers 
triclinic a-bFec actly 1-2 
monoclinic afte aSj7= 00" 48 3-15 
orthorhombic a#zbséc; a=B=y=90° 16-74 
tetragonal a=b c=f=7=90" 75-142 
trigonal e=b=c; c=6S=7F7 900" 143—167 
hexagonal a=b#c; a=68=90°,y=120° 168—194 


cubic a=b=c; a= ß8=y=90° 195-230 
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alternative shapes that a unit cell can have, depending on the re- 
lationships between the lengths of its sides and the angles between 
them, the association is one of shared symmetries rather than ac- 
tual geometry. There is no formal requirement for a link between 
the internal SLD symmetry of a unit cell and its external morphol- 
ogy, since space groups can be specified in terms of equivalent posi- 
tions that make no reference to lattice geometry, but they are found 
to be correlated strongly in practice. As such, the shape of the unit 
cell inferred from the reciprocal lattice provides the first indications 
about the probable space group of the crystal. 


7.2.5 Symmetry and statistics 


For reasons of atomic coordination, efficient molecular packing and 
energetic stability, crystallization does not occur with the same fre- 
quency in each of the 230 space groups. About a third of all small 
organic molecules conform to P 2;/c, for example, whereas proteins 
are found most often in the P 2; 2,2; and P2, configurations. Such 
statistical information can be helpful in the assessment of the space 
group, when the incomplete and noisy nature of experimental data 
leaves room for doubt on the basis of the Laue symmetry and sys- 
tematic absences of the diffraction pattern. 

An analysis of the distribution of Bragg peak intensities can also 
give an indication as to whether a crystal has adopted a centrosym- 
metric space group or a noncentrosymmetric one: the former tends 
to result in a higher proportion of reflections with very low intensity, 
I, relative to the average, u, as compared to the latter. This was first 
noted by Wilson (1949), and the theoretical curves bearing his name 
that capture this tendency are plotted in Fig. 7.5. An alternative 
to the conventional explanation, which postulates randomly located 
atoms of comparable scattering length, is given in the appendix of 
Sivia and David (1994). There the two probability distributions are 
seen to represent our state of knowledge about the intensity of a re- 
flection when given only the value of the average and the possibility 
of inversion symmetry. 


7.3 Circumventing the phase problem 


The biggest theoretical obstacle to ascertaining the structure of the 
crystal, 6.eu(r), from the diffraction data, {I;,,;}, is the lack of the 
Fourier phase: 


Ingi x |Fnkil , (7.32) 


where the SLD in the unit cell is related linearly to F),,; through 
eqn (7.27). The difficulties associated with the phase problem were 
highlighted in Section 2.6. Circumventing them hinges, ultimately, 
on our ability to supplement the missing information with suitable 


prob(/ |u) 


Fig. 7.5 The centric (dotted) and acen- 
tric (solid) Wilson distributions; (J) = pu. 
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prior knowledge. This entails the use of chemical and physical knowl- 
edge about the sample to varying degrees but, in some circumstances, 
might also involve the collection of several sets of carefully chosen 
diffraction measurements. 


7.3.1 Patterson maps 


The easiest way of getting around the phase problem is to avoid it! 
Rather than trying to work with the SLD function, just consider its 
auto-correlation: 


I. 1 


Inki X J | facelse.e.w)] el2m(hutkut+lw) dy dudw, (7.33) 
000 


where 
iii 
ACF|A( (u,v, w)| =| f fo x,y,z) B(at+u,ytv,zt+w) drdydz. 
00 0 


We discussed the ACF in a general one-dimensional context in Sec- 
tion 2.4.2, and in relation to the phase problem in reflectivity data 
in Section 4.1.4; we also met it in the guise of pair-correlation and 
pair-distribution functions in Sections 5.4 and 6.2. The main differ- 
ence in the crystallographic case, where it is called a Patterson map, 
is that it’s genuinely three-dimensional and often contains a high 
degree of symmetry. While the details of the latter depend on the 
space group of ((r), it always has the translational symmetry of the 
crystal lattice: 


ACF(u,v,w) = ACF (u+n1, v +n2, w +n3), 


where n1, n2 and ng are integers. As two (point-like) atoms located 
at rı and ro, and of scattering lengths b; and bz (assumed real), con- 
tribute to the ACF an equal signal, b1b2, at rı — rs and ro—rj, 


ACF|bıô(r—rı)+ b25(r—ra)| = (b? +03) (r) + b1b25(r +(r1—r2)) , 


Patterson maps are centrosymmetric. 

The linearity of eqn (7.33) means that there is a one-to-one map- 
ping between the diffraction pattern and the ACF of ((r). As such, 
the loss of phase of Fakı will be reflected in the Patterson map 
through its ambiguities with regard to the crystal structure. This is 
apparent from a simple consideration of their allowed symmetries: 
Patterson maps can adopt only 24 of the 230 space groups available 
to the SLD, implying that the relationship between them is not one- 
to-one. Despite this shortcoming, the real-space representation of 
the diffraction data provided by the ACF can be helpful in unravel- 
ling the crystal structure. 
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7.3.2 Heavy atoms and partial structure 


Although it’s difficult to ascertain G(r) from ACF | 3(r)| in general, 
the task of inferring the location and orientation of a certain frag- 
ment of the structure tends to be more accessible. If the sample is 
known to contain a heavy atom, for example, then the most intense 
peaks in the Patterson map from X-ray data are likely to be related 
to it; their analysis will often indicate its position. In fact, stoichiom- 
etry and space group symmetry alone can be enough to determine 
the location of some atoms. While the heavy atom scenario is rare 
in neutron work, as the strength of the nuclear interaction varies 
more uniformly across the periodic table, the presence of a negative 
scattering length among otherwise positive ones yields a Patterson 
map that is highly informative: the resultant negative peaks in the 
ACF will all be due to correlations between this reference atom and 
the rest of the structure. For macromolecules, such as proteins, the 
building blocks (amino acids) are usually well known. Their orien- 
tation may then be explored by looking for corresponding patterns 
of peaks in the Patterson map. 

Suppose now that part of the crystal structure is ‘known’. Denot- 
ing this generically as the heavy-atom fragment, y(r), and letting 
the unsolved portion be A((r), 


Br) = ulr) + ABQ). (7.34) 
If F, Fy and AF are the Fourier transforms of 3, 34 and AZ, so that 


i oe eee 


Fy(h, k,l) = vf f [batuv.w) el2a(hut ku tlw) dy dudw 


00 0 
and so on, then 


F = Fy + AF (7.35) 


for any set of Miller indices. Of these three complex quantities, Fy 
can be calculated from y(r), the modulus of F can be estimated 
from the diffraction data and the expected value of |AF|* can be 
ascertained from the nature of the unlocated atoms (Wilson 1942): 


IF|=Fo+o and (|AFI?) = é?. 


If the fraction of scattering material in y(r) is sufficiently large, so 
that |Fy| > £ (say), then the argument of Fy, ¢u, will provide a good 
approximation to the phase of F: 


F = Fo exp(ign). (7.36) 


The origin and reliability of this assignment can be understood from 
Fig. 7.6, which shows the constraint imposed on the value of F by 
a noisy measurement of its modulus-squared and information about 
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Fig. 7.6 The likelihood constraint on the value of a structure factor given a noisy 
measurement of its modulus-squared and a ‘known’ heavy-atom fragment. 


the heavy-atom fragment. As discussed in Section 2.6.3, and illus- 
trated in Fig. 2.23, the former restricts F to lie within a ring in the 
Argand plane. A knowledge of 6u (r) and the average characteristics 
of AG(r), on the other hand, suggests that the most likely value of F 
is Fy, but that it could deviate from this isotropically, with decreas- 
ing probability, over a length scale of €. As shown in Fig. 7.6, the 
combination of these two constraints gives an indication of the likely 
phase of the structure factor. For reflections with well-determined 
intensities (ø < £), the reliability of y as the phase of F is found to 
vary monotonically with 


Fo |F] 
¿2 
This ‘figure-of-merit’ was first noted by Woolfson (1956) and Sim 


(1960), and a modern example of its use can be found in Sivia and 
David (2001). 


FOM = 


7.3.3  Isomorphous replacement 


Through the selective addition or replacement of certain atoms in a 
crystal, it may be possible to produce two or more samples that differ 
from each other in a well-defined and structurally known manner. 
Denoting this calibrated change by A{(r), the original (or native) 
SLD, (r), and its isomorphous variant, (31(r), are related by 


B(r) = bilr) + AB(r) . (7.37) 


This is analogous to eqn (7.34), and its Fourier transform is similar 
to eqn (7.35): 


P= FAP: (7.38) 
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Fig. 7.7 (a) The likelihood constraint on the value of a structure factor, F, given 
additional diffraction data from (a) an isomorphous sample and (b) two such variants. 


for any set of Miller indices. Unlike the heavy-atom case, however, 
AF can be calculated, 


L i l 
AF(h, k,l) = v | | [Astuew ganut kutlu) qu dyudw, 
000 


and both |F| and |F | estimated from diffraction measurements on 
the native and isomorphous samples. The corresponding constraint 
imposed on the phase of F is illustrated in Fig. 7.7(a). Although the 
angular uncertainty is a fraction of the original 27 radians, there are 
two distinct possibilities. This ambiguity can be broken through the 
acquisition of scattering data from a further isomorphous variant, at 
least in principle, as shown in Fig. 7.7(b). The practical problem is 
that the derived samples are not usually as isomorphic as hoped: the 
addition of a heavy atom, for example, tends to distort the binding 
site in the native structure. 


7.3.4 Direct methods and prior knowledge 


Without the benefit of a heavy atom, or several sets of related data, 
the successful solution to the phase problem depends on the use of 
suitable prior knowledge. While many different (r) can give agree- 
ment with the diffraction intensities, {Inxı}, relatively few (ideally 
just one) will be consistent with the chemical content of the sample 
and have the expected elemental bond lengths and angles. This in- 
formation, and its implicit restriction on the structure factor phases, 
can be encoded through an appropriate analytical model for the SLD. 
In eqn (6.1), for example, 3(r) is described as the sum of N atoms 
of known type, and the structure is correspondingly defined by their 
locations. In the case of biological macromolecules, the connectivity 
of rigid molecular building blocks can often be exploited to express 
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G(r) in terms of a modest number of parameters. Despite such effi- 
ciency, finding a set of model variables that yield agreement with the 
scattering measurements, in the absence of a good initial estimate, 
is a computationally difficult task. 

Until the advent of silicon chips, a highly nonlinear optimization 
of the type involved above was not feasible. All that could reason- 
ably be calculated was a direct Fourier inverse of the data, but this 
required the explicit assignment of phases to the experimentally de- 
rived structure factor amplitudes: 


1 p . 
(u,v, w) X y DODD | Faki exp(i brki) e l2n(hutkut+lw) | 
h k l 


The crux of the problem, of course, is choosing the a 1! Theoretical 
progress on this issue was made by Karle and Hauptman (1950), for 
which they received the Nobel Prize in 1985. Starting with arbitrary 
values for two or three strong reflections, which fixes the origin of 
the unit cell, they were able to make predictions about other phases 
based on relationships that the structure factors had to obey since 
the electron density, G(r) for X-rays, could not be negative. The anal- 
ysis is complicated because the simple constraint of positivity in real 
space translates into a non-obvious set of inequalities in reciprocal 
space. The phase extension procedure is not definitive, however, 
and so a number of alternative assignments are usually explored. 
The ultimate test on the utility of this approach rests with the in- 
terpretation of the resulting SLD in terms of an atomic structure 
that makes chemical and physical sense. If it does, then the derived 
parameters can serve as a good point of entry for a model-based in- 
ference. Even when the ((r) obtained in this manner is far from 
perfect, there may be enough clues in the noisy reconstruction to en- 
able part of the structure to be solved; this information can then be 
used in the heavy-atom sense. 


7.4 Powdered samples 


The primary hurdle in crystallography is the actual growing of the 
sample. Producing a single crystal of sufficiently large size to yield 
good diffraction data is often time-consuming and a challenge, es- 
pecially for neutron work. For this reason, scattering experiments 
are also conducted on powdered samples consisting of a very large 
ensemble of small randomly oriented crystallites. The lack of align- 
ment means that, as in Fig. 5.7, the resulting differential cross- 
section is spherically symmetric: 


(E) = Sa(@) = $aQ). 


where Q = |Q|. In terms of the geometry of Fig. 3.3, therefore, the 
diffraction pattern is invariant with respect to the azimuthal angle 


(a) 


I (b) 
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Fig. 7.8 (a) A Debye—Scherrer cone for a powdered sample. (b) Part of the diffraction 
pattern from a zeolite for monochromatic X-rays. 


o because the modulus of Q depends only on 0 and the wavelength, 
Aà. This leads to conical trajectories of equal scattering, as illustrated 
in Fig. 7.8(a), or circular rings of uniform intensity in a plane per- 
pendicular to the incident beam. 

An example of a powder pattern is shown in Fig. 7.8(b): it is a 
plot of the scattered intensity, J, as a function of 26 for X-rays of a 
fixed wavelength. The peaks in the signal occur when the spacing 
between the atomic planes in the sample, d, and the angle @ satisfy 
Bragg’s law of eqn (7.15). In terms of the reciprocal lattice formula- 
tion of eqn (7.6), this happens when 


An sin 8 
A 
Since this condition may be satisfied by more than one set of Miller 
indices, it leads to the problem that the area under a Bragg peak, 
I(sin@/X), can correspond to a linear mixture of the structure factor 
intensities of several reflections: 


I (sin 6/2) x ` |Enkil , 


(7.39) 


= |kA+kB+1C|. (7.39) 


where the summation is over the hkl that obey eqn (7.39). The 
potential for such overlaps is easiest to see for a cubic unit cell, of 
side a, when eqn (7.39) reduces to the Pythagorean requirement 


a 


h2 k? 12 = 
FR ( . 


Simple coincidences, such as those due to a permutation of h, k 
and l above, pose no difficulty if the space group symmetry ensures 
the equality of the related | F),x:|; the number of the latter is called 
the multiplicity of the reflection. The entanglement of the (005) set 
with (034) is a more troublesome cubic case, and representative of 
the diffraction information lost in having a powdered sample rather 
than a single crystal. 

Peak overlap also occurs for reflections whose Qs are sufficiently 
close, though not identical, due to their non-zero widths; this can 
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> Qr 


be seen from the expanded inset in Fig. 7.8(b). The severity of the 
problem increases with Q because the volume of reciprocal space 
enclosed within a thin spherical shell of radius |Q| is proportional 
to Q?; the Q-dependence of the peak widths tends to exacerbate the 
effect. The first step in the analysis of a powder pattern involves an 
examination of the low-angle Bragg peaks, with a view to assigning 
hkl-indices to the isolated low-order reflections and ascertaining the 
lattice constants; systematic absences give an indication of the space 
group. 

The spread and shape of the Bragg peaks results from a combi- 
nation of instrumental factors, such as the source—sample—detector 
distances and the collimation angles, and the intrinsic properties of 
the sample. The size of the crystallites makes a contribution, for 
example, as the situation is akin to diffraction from a finite three- 
dimensional grating; the one-dimensional analogue is illustrated in 
Fig. 2.17. Residual stresses in the sample also smear out the peaks 
as the X-rays, or neutrons, see a distribution of d-spacing relative to 
the unstrained material. 

Before moving on, we should note two other general features that 
can be seen in Fig. 7.8(b). Firstly, there is some scattered intensity 
between the Bragg peaks. This slowly varying signal can arise from 
several factors such as multiple scattering, poor shielding, incoher- 
ent (neutron) scattering and various types of aperiodic imperfections 
in the sample known as disorder. This background intensity must 
be subtracted, or accounted for, in order to carry out the crystalline 
analysis. Secondly, the Bragg peaks seem to diminish in magnitude 
with increasing 9. This is to be expected given the Q-dependence of 
X-ray form factors and the temperature-related Debye—Waller term, 
discussed in Sections 3.2.2 and 3.3.2. 


7.4.1 Texture 


The central assumption in powder diffraction is that of a very large 
number of randomly oriented crystallites. This starts to break down 
if there are too few of them, or if they are aligned to some degree. 
The latter is called texture or preferred orientation. A single crystal 
can be regarded as one extreme and an ideal powder as the other. 
Texture analysis is of interest in the intermediate regime found in 
worked materials and geological samples. 

Texture can be quantified through an orientational distribution 
function, or ODF, which is usually specified in terms of Euler angles: 


ODF = ODF(®,6,W), 


where 0 < ® < 360°, 0 < © < 180° and 0 < Y < 360° define the orien- 
tation of one set of Cartesian axes, x y z, taken to be aligned with a 
crystallite, relative to another, XY Z, assumed to be fixed within the 
sample as a whole. In essence, ® and O correspond to the longitude 
and colatitude of the z-axis with respect to XYZ, with Z as north 


and X the prime meridian, and W allows for a rotation of the 7-y 
plane about z. The fraction of crystallites with their Euler angles 
lying between ® and 6+d®, © and 0+d0, WV and V+ dW is given by 
the value of the ODF at (®,0,W) times sinO d dO dY, where sinO 
is the relevant Jacobian. If S,.)(Q|®, 0, Y) is the scattering function 
of the crystal structure that has been rotated through ®, © and Y, 
then the differential cross-section is its average over the ODF: 


(5) x J|] saQle, ow) ODF(®, 0, Y) sinO dé dO dY. 
el 


In texture work, (ccu(r) is typically the known structure of a sim- 
ple metal or mineral. This means that S.)(Q|®, 0, Y) can be calcu- 
lated, and the analysis task is one of trying to infer the ODF from 
the diffraction measurements. 


7.4.2 Twinning 


In single crystal work, a systematic departure from a 6-function ODF 
sometimes occurs when the internal symmetry of the unit cell is 
lower than that of its external geometry. The situation is illustrated 
schematically in Fig. 7.9, and is known as twinning. The differential 
cross-section in this case is a discrete sum of the two, or more, single 
crystal components comprising the sample: 


few 
do 
—) = X n Sea(Q\®;,0;,¥) , 
dQ). = 


where the domain coefficients {7,;}, and the crystal structure (.cu(r), 
are estimated from the diffraction data while the Euler angles are 
calculated for an assumed twinning symmetry. 

A related problem involves the analysis of chiral structures when 
the sample contains enantiomerically pure regions of both left- and 
right-handedness. The coefficients niert and Nrigat must then be com- 
bined with S.)(Q|left) and Sa (Q |right) to model the differential cross- 
section for a given beenlr). 


= 


E 
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Fig. 7.9 A schematic illustration of (4-fold) twinning, whereby a sample consists of 
an uncorrelated mix of several types of symmetry-related single crystal domains. 
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7.4.3 Fibre diffraction 


Some of the earliest molecular compounds to have their structures 
solved were biological fibres; of these, the DNA work of Watson and 
Crick (1953) is the most famous. Although the details depend on the 
situation, diffraction from a fibrous sample is akin to the highly tex- 
tured powder case where the long axis of all the crystallites is lined 
up in a given direction. Rotational averaging about this unique axis, 
z say, means that the differential cross-section is two-dimensional: 


do 
Ga x Dell Gry Q) 5 


where Q? =Q2+Q;. While this is more informative than a powder 
pattern, Sa(Q), it is not as unambiguous as the three-dimensional 
data from a single crystal. As such, fibre diffraction has largely be- 
come a niche technique. 


7.5 Magnetic structures 


Apart from a brief mention in Sections 1.4.3 and 3.3.4, and a length- 
ier discussion in Section 3.2.3, we have implicitly taken the samples 
to be non-magnetic. This is inconsequential for all but a few cutting- 
edge X-ray experiments, but is a significant omission for neutrons. 
Since a full theoretical treatment of magnetic scattering rapidly be- 
comes very technical, we will restrict ourselves to a consideration of 
some basic issues. 

One of the reasons for the complexity of this topic is the vecto- 
rial nature of magnetic moments. Unlike nuclear scattering lengths, 
they have both a magnitude and a direction. The analysis simplifies 
considerably when the moments can only be in one of two directions, 
‘up’ or ‘down’, for they can then be treated like scalars: they’re just 
positive or negative. Three crystalline structures of this type, corre- 
sponding to a binary compound, are illustrated in Fig. 7.10. With all 
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Fig. 7.10 Two-dimensional examples of (a) a ferromagnet, (b) an antiferromagnet 
and (c) a ferrimagnet. The coloured discs represent the nuclear cross-sections and 
the arrows indicate the magnetic moments. 


io 


its magnetic moments aligned, the first is a ferromagnet; the second, 
with alternating directions and no net moment, is an example of an 
antiferromagnet; the third, where the up and down contributions do 
not cancel completely, is a ferrimagnet. 

Scattering from a repetitive pattern of moments leads to Bragg 
peaks in the differential cross-section, just as in the non-magnetic 
case. The nuclear and magnetic signals will overlap for (a) and (c) in 
Fig. 7.10, due to their shared unit cell, but there will be additional 
magnetic intensity at intervening points in (b) because the period 
of the moments is twice as large as that of the nuclear structure. 
When the temperature of the sample is raised sufficiently to destroy 
the ordering of the moments, thereby turning the material into a 
paramagnet, the magnetic portion of the scattering transitions into 
a slowly varying incoherent background. At very low temperatures, 
when the moments are ordered and the Debye—Waller term can be 
neglected, the magnetic peaks diminish with Q whereas the nuclear 
contribution continues out to high values. 

Neutron diffraction data from a powdered inorganic compound are 
shown in Fig. 7.11 for two different temperatures: 5K in blue and 
295 K in grey. The peaks in the difference between them, which are 
shaded in light blue in the expanded inset, represent the Bragg sig- 
nal from the magnetic structure. One of them coincides with a peak 
at room temperature, which is entirely from the nuclear structure, 
but the other does not; the ‘new’ intensity at 20 ~ 26° indicates that 
the sample is antiferromagnetic. The magnetic signal decays rapidly 
to zero at higher angles due to the associated form factor. The two 
sets of data are virtually identical beyond 26 ~ 30°, therefore, apart 
from a slight shift in the Bragg positions due to the thermal expan- 
sion of the sample. 

Magnetic moments can form repetitive patterns even when they 
are not restricted to lie in only one of two directions. If their period 
is a rational fraction of the size of the nuclear unit cell, such as the 
2:1 in Fig. 7.10(b), then the two crystalline structures are said to be 
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Fig. 7.11 Part of the low-angle neutron data from a powdered sample of the antifer- 
romagnet Co1.2 Feo.s Geo.5 Gao.5 Mos N, taken on the D2b diffractometer at the ILL 
with \=2.4A, at a temperature of 5K (blue) and 295 K (grey). (Sviridov et. al., 2010) 
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commensurate; otherwise, they are incommensurate. The greater 
the complexity of the situation, the more the scattering experiments 
benefit from the use of polarized neutron beams and a single crys- 
tal sample. As the associated generalizations and analyses quickly 
lead into territory beyond the scope of this elementary text, the in- 
terested reader is referred to Bacon (1955), Squires (1978), Lovesey 
(1986), Lovesey and Collins (1996), and so on, for a more advanced 
and comprehensive treatment of the subject. 


Part III 


Inelastic scattering 


Energy exchange and 
dynamical information 


Having gained experience with elastic scattering, we now move on 
to consider the general case where there is an exchange of energy as 
well as momentum. 


8.1 Experimental considerations 
The first complication to arise when going from elastic to inelastic 


scattering is the need for two wavelengths, à; and Aș for the incident 
and outgoing beams respectively, because 


27 2m 
k;| = — # — = |ky|. 8.1 


The X-ray, or neutron, loses energy if A; < Az and gains it if à; > Ar. 
The wavevector diagram is shown in Fig. 8.1. Unlike in Fig. 3.2, the 


Q= kik (8.2) 


triangle is not isosceles. Consequently, the moduli of these vectors 
and the scattering angle have to be related through the more general 


formula: 
Q? = k? + kf — 2k ky cos26 |, (8.3) 


w= 
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Fig. 8.1 The wavevector diagram for inelastic scattering through an angle of 20. 
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which reduces to eqn (3.7) in the elastic limit of eqn (3.6). Adopting 
the geometrical setup in Fig. 3.3, the vectorial content eqn (8.2) is 
encapsulated by 


2 
Q= aui (- sin 20 cos ġ , — sin 20 sin ġ, an — cos 26) , (8.4) 
Af Ai 


which can be shown to satisfy eqn (8.3), as required, and to simplify 
to eqn (3.8) when à; = Ay. 


For a photon, the transition from A; to Az is associated with a 
frequency change of 


1 1 
w= 2re( 5-5) = w-ar, (8.5) 
where c is the speed of light, and a related transfer of energy 
E = ħw = E; -— Eş (8.6) 


to the sample. The corresponding expression for a neutron is derived 
from a consideration of its kinetic energy, (hk)? /2m,. In terms of the 
w in eqn (8.6), it takes the form 


thf 1 1 


The scattered particle loses energy if E >0 and gains it if E <0; the 
converse applies to the sample. 


8.1.1 The partial differential cross-section 


In Section 3.1, the differential cross-section, da/dQ, was introduced 
as the key object that summarized a scattering experiment. It was 
also noted that these (20, /)-measurements could be translated into 
the more theoretical notion of a scattering law, S(Q), most easily if 
all the interactions were taken to be elastic and the incident beam 
was monochromatic. Assuming a unique 4,, for simplicity, the idea 
can be extended to the inelastic case through the partial differential 
cross-section: 


Po _ " da/dQ for E; between E’ and E'+ AE’ 
dQ dE’ 7 AE’—0 AE! 


(8.8) 


where, as indicated earlier, Aç is related to Ey differently for X-rays 
and neutrons. In this formulation, the differential cross-section is 
given by the integral of eqn (8.8) over all possible final energies, 


do T d2a s 
a lua dE’, (8.9) 
0 
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whereas the elastic component, which dominates the scattering, is 
just the value of the partial differential cross-section when there is 
no exchange of energy, 


do E deg 
dQ),  dQdE’ 


Likewise, the cross-section, of Section 3.2.4, is equal to the integral 
of da/dQ over all directions: 


(8.10) 


E'= E; 


T 2T 
d 
o (Aj) = J A dQ, (8.11) 
20=0 ¢=0 


where the element of solid angle dQ = sin 26 d26 dọ. 

While the potential dependence of o on à; has been made explicit, 
it is implicit in eqns (8.8) and (8.9). Indeed, the partial differential 
cross-section is additionally a function of 20, ¢ and \;, which is why 
it can be associated with S(Q,w) through eqns (8.4)-(8.7). In terms 
of this scattering law, 


Wi 


S(Q) = [sae dw and S.a(Q) = 8(Q,w=0), (8.12) 


—_— co 


where the incident particle can transfer no more than its initial en- 
ergy, ħw;, to the sample. 


8.1.2 Triple-axis spectrometer 


Following the work of Brockhouse (1955), inelastic experiments are 
traditionally carried out with a triple-axis spectrometer. Its main 
components, excluding the collimation hardware, are illustrated in 
Fig. 8.2. It differs from an elastic, or double-axis, setup through the 
addition of an analyser stage, since the final and incident energies 
are no longer assumed to be the same. The analyser is simply a 
monochromator that intercepts the scattered beam. Both selectively 


et Ay 
YS) > | Detector 
An 


r a 
n Ai 


Fig. 8.2 A schematic illustration of a triple-axis spectrometer. 
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reflect particles of a given wavelength based on the Bragg condition 
satisfied by suitably oriented crystals, as discussed in Section 7.1.2. 
An appropriate choice of \;, Ay, 20 and ¢ allows the desired value of 
Q and w to be accessed. 


8.1.3 Time-of-flight instruments 


Although a triple-axis spectrometer is very versatile, each monochro- 
mation stage takes a heavy toll on the particle flux. One of the two 
can be avoided naturally at a spallation neutron source by using the 
time-of-flight technique. The method was outlined for elastic scat- 
tering in Section 3.5.1, but the principle is the same in the inelastic 
case. If Lı is the length of the flight path from the moderator to the 
sample, and Lə is the distance from the sample to the detector, then 
the neutron will take a time 


Mn 


t 
h 


(Ly Ai + Lo Af) (8.13) 
to travel from the source to the detector. This reduces to eqn (3.52) 
when à; = Af and L = Lı + L2. With t = 0 set by the proton pulse 
of the accelerator, eqn (8.13) enables Aş to be calculated from a 
measurement of the neutron’s arrival time if à; is known; similarly, 
A; = Ai(t, Af). There is no need to have both a monochromator and 
an analyser, therefore, as either will suffice. 

Time-of-flight setups where the incident wavelength is fixed are 
known as direct geometry instruments; they often employ mechan- 
ical choppers to select neutrons moving with a particular velocity. 
Those with a polychromatic incident beam and energy analysers for 
the scattered neutrons are said to have an indirect geometry. As a 
function of time, t, at a given 20 and ¢, such instruments explore 
(Q,w)-space along one-dimensional trajectories; this is illustrated 
in Fig. 8.3 for |Q| and EF. 
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Fig. 8.3 The energy and momentum transfer accessed by various 20 detectors in a 
time-of-flight instrument with (a) direct geometry (E; = 130 meV, Lı =12m, Lə =4m) 
and (b) indirect geometry (Eş =2meV, Lı =36m, Lə =1.5m). 
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8.2 Scattering from time-varying structures 


In Chapter 3, we first considered scattering from a single fixed atom 
and then moved on to an assembly of such entities. It was asserted 
that the time-invariant nature of the associated SLD, (G(r), guaran- 
teed that the scattering interactions were elastic. A deeper insight 
into this claim emerges when we remove the temporal restriction, 
so that 8 = (r,t), and generalize the analysis of Section 3.3 to the 
inelastic case. 

With 6= (r,t), the time dependence of the incident and scattered 
waves must be made explicit. As such, eqn (3.12) becomes 


Wi = Wo el (kez — wit) | 


where y, is a complex constant related to the incoming flux, &. Sim- ® x |ypol? 
ilarly, eqn (3.29) takes the form 


n i[kye(r’—r) — wrt] 
dyp = vy ellkier— 4) [B(x t) dr dt a (8.14) 
r-r 
for the isotropic contribution to the outgoing wave at r’ made by the 
SLD within an infinitesimally small volume centred on r between t 
and t+ dt. In the far-field limit, when |r’—r| — |r’| = 1’, the integral 
of eqn (8.14) over the sample and time gives 


ikyer’ : 
we = ne JJJ o t) el (Qer—wt) dêr dt, 
r 
V T 


where w and AQ are the energy and momentum transfers in the 
process. Since the probability that an incident particle will be scat- 


tered through 20 and ¢ with a wavelength of Aç is proportional to 6Q = “5 
we I”, the partial differential cross-section is given by 
2 
d*o Cf i(Qer—wt) 43 
MA er wW .1 
aar % A If]fos d°’r dt f (8.15) 
VT 


where E’= fw, and 


Ai /Az for neutrons, 
(8.16) 


1 for photons. 


The ratio of final-to-initial speeds, c;/c;, enters because the incident 
and scattered flux depend in part on how fast the particles are mov- 
ing. Being unity for both neutrons and X-rays in the elastic case, 
this term was omitted earlier for simplicity. In essence, the partial 
differential cross-section is the modulus-squared of the space—time 
Fourier transform of (r,t); the latter is formally S(Q, w). 
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8.2.1 Elastic scattering 


Having derived eqn (8.15), we can confirm that a static SLD does 
indeed give rise to purely elastic scattering. If G(r,t) = G(r), then 
the four-dimensional Fourier transform separates into a product of 
spatial and temporal terms, 


MEG elQer gy fra. 
V T 


When integrated over a sufficiently long time, the latter is approxi- 
mated well by a 6-function: 


CO 


fat = 2r ôð(w). 


— Co 


As this is non-zero only for w =0, or E’ = E;, the interactions must 
be entirely elastic. 

Equation (8.15) also helps us to understand that the elastic com- 
ponent of the scattering tells us about the time-averaged structure 
of the sample. To see this, we just need to substitute w=0 and carry 
out the t-integral first: 


2 


do 1Qer 3 

dn dB" x J| [pede dèr] , (8.17) 
E'=E; =) 

where g 
(Bimi = - / G(r, t) dt. (8.18) 


Information about the dynamical aspects of the SLD is contained in 
the inelastic part of the scattering function. 


8.2.2 Space-time correlation function 


The presence of the modulus-squared in eqn (8.15) makes it very 
difficult for 3(r,t) to be inferred from the partial differential cross- 
section, in general. As noted on several occasions, however, the un- 
derlying phase problem is avoided if we work instead with the ACF, 


G(r,t) = | {[foc.ty B+R, t+) d°R dt’, (8.19) 
Vo T 


because this is related linearly to the scattering function: 


2 
aa x * JJJ fae tj Orr) d rdt. (8.20) 
Ci 
V T 


S(Q, w) 
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Apart from being four-dimensional, G(r, t) is essentially no different 
from the ACFs met earlier: it gives us a measure of the correlation 
between pairs of points in the sample which are separated by a dis- 
placement r and a time t. Simultaneously occurring spatial associa- 
tions are encoded in G(r, 0), therefore, and temporal linkages at the 
same location are captured by G(0, t). If a structure is static, then 
G(r,t)=G(r). 

Scattering theory was first developed from the perspective of the 
space—time correlation function by Van Hove (1954). In terms of 
this, the results of Section 8.2.1 are 


q?’ iQer 43 
nan % sto) |] ec) e d°r (8.21) 
V 


for a static structure and 


(ia), * J JJ (G(r, 8), AO ae, (8.22) 
V 


with G replacing ( in eqn (8.18). 


8.2.3 Total scattering 


As noted in Section 3.1.3, most diffraction experiments do not have 
any energy transfer discrimination and rely on the preponderance 


of elastic scattering: 
do _ (do n do qae 
dQ 7 dQ el dQ a dQ el l 
Using the G(r,t) formulation, we are in a position to appreciate the 


structural significance of the quantity that is actually measured: the 
total differential cross-section. Substituting for d?a0/dQ dE’ from eqn 


(8.20) in eqn (8.9), 
z x f | | i: G(r, t) 2°" £(t) d rdt (8.23) 
VF 
where 
f(t) = I £ eiet qp’ (8.24) 


is the function resulting from the integral of everything that could 
depend on E’. With cs = c; for photons, and hw = E; — E’, 


f(t) = h f itaw x ôt). ħdw =-—dE’ 


— OO 
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Putting this 6-function into eqn (8.23), the t-integral becomes very 


easy and leads to 
ad x JJe 0) elQer sy (8.25) 
dQ ` 
V 


for X-rays. The same expression holds for neutrons, albeit with a dif- 
ferent proportionality constant, but showing it requires extra effort. 
It hinges on the fact that, to within a scaling factor, 


f(t) = n | [A ita 
Wi 


is also approximated well by a -function at t=0. 

The distinction between the elastic and total differential cross- 
sections follows from eqns (8.22) and (8.25): the former is related 
to the time-averaged structure whereas the latter pertains to an in- 
stantaneous view, or snap-shot, of it. The two will be identical, of 
course, if G(r,t)= G(r). While this might be a reasonable assump- 
tion for a solid, it’s far from being applicable for a liquid sample. 
In fact, liquids should produce no elastic scattering, other than at 
Q =0, because their time-averaged SLD is uniform. The analysis of 
such data depends crucially on the ‘static approximation’, whereby 
|w|<<w;, as the pseudo-elastic condition of \; +A, is necessary, in the 
absence of d?a0/dQ dE’, for transforming the measured do/dQ into 
the corresponding S(Q). 


8.2.4 Coherent and incoherent scattering 


In Section 3.3.3, we saw how the elastic differential cross-section for 
the nuclear scattering of neutrons could be expressed as the sum of 
a coherent and an incoherent contribution. A similar decomposition 
can be performed for inelastic scattering: 


d?o d?o d?o 
SS a : 2 
dQ dE” (x5 i). ý (= im). j o 


In Chapter 3, the coherent component arose from the part of the SLD 
that showed a discernible spatial pattern and the incoherence re- 
sulted from the random nature of what was left over. With the extra 
time dimension associated with inelastic scattering, (d?0/dQ dE") con 
is related to the correlated motion of the atoms in the sample while 
(d?a0/dQ dE’) incon has to do with their independent behaviour. Thus 
coherent scattering tells us about collective excitation modes, such 
as phonons and magnons, whereas incoherent scattering contains 
information on local dynamics, such as diffusion of single atoms. We 
will see examples of both in Chapter 9. 
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8.3 A quantum transitions approach 


Other than invoking wave—particle duality for the neutron and X- 
ray probes, through use of the de Broglie and Planck hypotheses, 
and appealing to some basic spin arithmetic when necessary, our 
analysis has been based entirely on classical physics. We have not 
taken into account the quantum properties of the sample itself in 
any meaningful way. The remedy for this shortcoming rests on a 
reformulation of the scattering problem in terms of transitions be- 
tween discrete initial and final states of the particle-sample system 
as a whole. Since a proper treatment of the subject rapidly becomes 
quite daunting, even within the usual approximation of ‘first-order 
time-dependent perturbation theory’, we will merely highlight a few 
key points. Details can be found in more advanced texts on scatter- 
ing theory, such as Lovesey (1986), and quantum mechanics books, 
such as Baym (1969). 

As before, the incident and outgoing energy and momentum of the 
scattered particle is determined by the wavevectors k; and ky. The 
difference now is that the corresponding initial and final state of the 
sample, indexed by 7) and n’ respectively, must also be specified to 
define the scattering event completely. Using Dirac’s compact no- 
tation, and ignoring issues of spin and polarization for simplicity, 
transitions from the configuration |k;,7) to |ky, 7’) will occur at a 


rate of 
27 j 2 
Rkin>krn' = i kon |V)ki,7) ) (8.27) 
subject to the conservation of energy constraint that 
h 
[hw + By —En| $ > (8.28) Ww = wi — Wy 


where V(r) is the perturbative Fermi pseudopotential, which acts for 
a time At, representing the interaction between the sample and the 
X-ray or neutron. The number of transitions will also depend on 
the probability of finding the the sample in the initial state 7) F 
Assuming a thermal equilibrium at temperature T, the latter will 
follow a Boltzmann distribution: 


= prob{|n)|T) ox exp (- =) (8.29) 


As scattering measurements relate only to changes in the wavevec- 
tor of the probe, the relevant transition rate is given by a weighted 
summation over all the possible sample states: 


Resi =) Py dS) Rian kp mn’ 5 (hw+Eq— En’) , (8.30) 
n n’ 


where the constraint of eqn (8.28) has been replaced by a 6-function 
on the understanding that h/At %0. 
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dk; 


The partial differential cross-section requires an evaluation of the 
rate, R(20,¢,E’), at which particles are scattered into a tiny solid 
angle, dQ, at (20, ġ) with Ey between E’ and E’+ dE’. This entails a 
summation of eqn (8.30) over the small volume of phase space, 

ak ddd = k2a0 “LB (8.31) 
f f f f JE; f > . 
which is consistent with the given direction and final energy range. 
Assuming that R k; — k; does not change abruptly with ky, a uniform 
continuum of possibilities for the outgoing wavevector implies that 


R(20,ġ, E') x Rk, >w dk’. (8.32) 
Substituting from eqns (8.31) and (8.32) into the basic definition 


d's ——-R(20,¢, E’) 
dQdE’ NG®dQdE’’ 


where the incident flux ®«c; and the normalizing N is the number 
of scattering units in the sample, and using the fact that 


for both X-rays and neutrons, we find that 


d?o k’? 
oar a E en 
where Rx, —w follows from eqns (8.27) and (8.30). This is the usual 
starting point of more erudite discussions on scattering theory, but 
we will not pursue this approach further here. 


Examples of inelastic 
scaitering 


In this final chapter, we consider several examples of inelastic scat- 
tering. Our aim here is not to be comprehensive, or too theoretical, 
but to gain an appreciation of the types of dynamical behaviour that 
can be probed with X-rays and neutrons. 


9.1 Compton scattering 


The earliest example of inelastic scattering at the atomic level dates 
back to 1922, with the work of Compton on the scattering of X-rays 
by a thin foil of graphite. He found that the signal at any given scat- 
tering angle, 20, contained two wavelengths: one was equal to that 
of the incident monochromatic beam, \;, as expected from classical 
radiation theory, but the other was longer. The wavelength shift of 
the latter, 6\ = A29 — à;, depended on 26 according to 


dA = (1 — cos 20) , (9.1) 


Wise 


and the process was associated with the emission of electrons. Comp- 
ton explained this component of the scattering by considering it to 
be the result of a billiard ball type of collision between a photon and 
a stationary electron, as illustrated in Fig. 9.1. 


Before After 


Fig. 9.1 The kinematical picture used by Compton to explain the unexpected change 
in the wavelength of some X-rays upon scattering from an electron. The relativistic 
correction is encoded through the introduction of the Lorentz factor, y. 
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The historical importance of Compton’s work was that it provided 
support for a particle picture of photons at a time when the notion 
was still controversial: they could be associated with a well-defined 
momentum, h/\, and energy, hc/\, and interact with an individual 
electron. While Compton’s analysis was based on a collision with a 
stationary electron, a relaxation of this condition shows that infor- 
mation on the momenta of the electrons can be obtained from such 
experiments. Since the X-ray case is complicated by the need to use 
relativistic mechanics, as significant Compton scattering only occurs 
at incident energies comparable to mec’, let us consider instead the 
collision between a neutron and a nucleus; the physical principles 
are the same. 

If the momentum of a nucleus of mass M before the collision is p;, 
and that after the impact by the neutron is pp, then this two-body 
interaction must satisfy 


Pf — Pi = ki- ks = AQ, (9.2) 


from the conservation of momentum, where k; and ky are the usual 
incident and final wavevectors of the neutron, and 
Ips? — |pal” Aki? Akg” 
_ = — = 2h 9.3 
M M Mn Mn oe ae 
from the conservation of energy (in the classical limit). Substituting 
for ps from eqn (9.2) into eqn (9.3), 


|p|? = preps = (AQ+p:) (AQ +p:), 
we obtain 


h|Q\? 
=L. 


Q.p; = Mw - (9.4) 


Hence the strength of the scattered neutron signal, with an energy 
and momentum transfer of iw and AQ, will be related to the num- 
ber of nuclei with a p; that satisfy eqn (9.4). If all the nuclei were 
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Fig. 9.2 Neutron data from polythene, at a scattering angle of 20 = 50°, taken on the 
VESUVIO spectrometer at the ISIS facility. The expected signal from stationary nuclei 
of hydrogen and carbon is indicated with a dotted line. (Courtesy of Dr. J. Mayers.) 


stationary, so that p; = 0, then there would be a sharp peak in the 
energy transfer spectrum whenever 


E = ħw = À (9.5) 


For a fixed |Q| = Q, this means there will be as many peaks as nu- 
clear species (masses) in the sample. They will be smeared out some- 
what due to the instrumental resolution function, of course, but any 
additional broadening would be indicative of a spread in the initial 
momenta. An example of such data, from a sample containing only 
carbon and hydrogen, are shown in Fig. 9.2. 

Focusing on the scattering from a single nuclear species, typically 
hydrogen, the preceding discussion can be summarized mathemati- 


cally by 
2 
su(Q.w) x ff fa) ife = a 22] dp, (9.6) 


where n(p) is the momentum distribution of the atoms of mass M 
within the sample and the value of the 6-function is nought unless 
its argument is equal to zero. For the stationary case, n(p)= ô (p). 
Given the central requirement of eqn (9.4) for observing a Compton 
signal, the related scattering function is often expressed in terms of 
the unit vector Q=Q/Q and the component of p in that direction, 


~ M ho? 
y= pĝ = Z (0-2), (9.7) 
instead of Q and w. Formally, 
A h 
Iu (Q, y) = tg Sm (Q, w) . (9.8) 


The advantage of this change of variables, or “y-scaling’, is that the 
Compton profile becomes one-dimensional when n(p) = n(|p}|): 


Ju (Q,y) = Ju(y) 


because the scattering cannot depend on the direction of Q when the 
momentum distribution is isotropic. 


9.1.1 The impulse approximation 


Perhaps the strangest thing about Compton scattering is not that 
the incident X-ray or neutron can be treated like a particle, but that 
the entity struck behaves as though it were an isolated object rather 
than an integral part of the structure within which it resides. That 
is the assumption, at least, under which the analysis is carried out. 
It is based on the impulse approximation, whereby the collisional 
interaction takes place so quickly that the neighbouring atoms, or 
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electrons, don’t have time to respond to the changing situation. Its 
validity depends on the incident energy and momentum transferred 
being sufficiently high that the local binding energy is negligible and 
the related interaction time scale is much shorter than that of any 
process in the sample. 

The uncorrelated nature of the collision process above means that 
Compton scattering is incoherent. With reference to the neutron 
case and Section 3.2.4, the strength of the signal from a particular 
nuclear species then depends on its average cross-section, 47 (0? 
In terms of the definitions in eqns (3.26) and (3.27), this is actually 
the sum of the coherent and incoherent cross-sections; as such, it 
is usually called ciot. The much larger size of the hydrogen peak 
in Fig. 9.2 compared to that of carbon, even though their number 
ratio in polythene (—C H2—) is only two-to-one, is due to its enormous 
incoherent cross-section. For X-rays, the scattering length involved 
Ag = is the Compton wavelength, Ac, rather then the classical (Thomson) 
radius of the electron, re, of eqn (3.19). Their ratio is 


Te e? Q 1 


E = ae 9.9 
AC Ateghe 27 861’ a) 


where a is the fine structure constant. 


9.1.2 Single particle wave function 


The momentum distributions at the heart of Compton scattering are 
also linked to other quantities. The simplest connection is between 
an n(p) and the average kinetic energy of the associated particle. 
For nuclei of mass M, 


(K.E.) = a = za) Ip|? n(p)dëp, (9.10) 


where we have assumed that n(p) is correctly normalized so that 
its integral is equal to unity. For an isotropic distribution, this is 
related to the fourth moment of n(p) and, somewhat less obviously, 
the second moment of J (y). 

At a more fundamental level, the wave function, p, describing the 
quantum mechanical behaviour of a particle can be expressed in 
terms of its position, r, or its momentum p. The two are connected 
through a Fourier transform: 


p(p) = no? f fw) QP srl? gip, (9.11) 


where n(p) = |~)(p) |? and the spatial probability density is given by 
lY(r)|?. The relationship between n(p) and w(r), in turn, provides 
a link between Compton scattering and the potential, V(r), experi- 
enced by the associated particles, through Schrodinger’s equation. 
An interesting example can be found in Reiter et al. (2002). 


9.2 Lattice vibrations 


In sharp contrast to the incoherent nature of Compton scattering, 
X-rays and neutrons can also interact with the correlated motions of 
the atoms within a sample. They can create and annihilate collective 
vibrational modes, such as sounds waves, called phonons. While this 
behaviour does occur in liquids and amorphous solids, it is seen most 
clearly in crystalline materials. 

The simplest physical model that allows for an understanding of 
phonons consists of a long linear chain of atoms, all of mass m and 
separation d, connected by identical springs of stiffness a. This is 
illustrated in Fig. 9.3, with u,, denoting the displacement of the nt? 
atom from its equilibrium position. Applying Newton’s second law 
of motion (F = ma) to atom n, 


os d?un 

d 
where we have assumed that the displacements are small enough 
that Hooke’s law is obeyed and t is time. Following Sections 2.1 and 


2.2, the pattern of us will constitute a travelling wave, of frequency 
w and wavenumber k, if 


(9.12) 


a (Un+1— Un) — Q (Uun — Un—1) = 


u = Ap root, (9.13) 


for j =1,2,3,...,~10°3, where xj =jd and |A| «d. Substituting this 
into eqn (9.12), with j=n—1, n and n+1, it can be verified as a valid 
solution provided that 


w? = 2 sin") (9.14) 
m 2 


Taking the frequency to be positive, but allowing the wavenumber to 
be negative (for a ‘backwards’ travelling wave), the dispersion curve 
has the property 


2 
w(k)=w(-k) and w(k) = a(k). (9.15) 
This symmetry and periodicity means that w (k) is defined uniquely 


for O< k <a/d. The region |k| < a/d is known as the first Brillouin 
zone. 


Fig. 9.3 A one-dimensional model that exhibits the most basic phonon behaviour. 


-n/d 
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If the one-dimensional ball-and-spring model is modified to have 
two different masses, m and M, alternating along the chain, then it 
leads to an interesting new behaviour also characteristic of phonons: 
there can be more than one type of lattice vibration, with its own fre- 
quency, for any given wavenumber. Inserting the atoms of mass M 
in between the existing ones in Fig. 9.3, to maintain the periodicity 
over d, and denoting their displacements from equilibrium by v,, for 
j =1,2,3,...,~107%, the travelling wave 


Up = Aecl(kti—“t) and vj = Belki- ot), (9.16) 


where x; = jd, can be shown to satisfy the equations of motion pro- 
vided that 

u? = o (bth) tay (Eti sit. eD 
Although this expression is more complicated than eqn (9.14), the 
two solutions, w4 (k) and w- (k), share the symmetries of eqn (9.15). 
The positive and negative branches of the dispersion curve are called 
the optical and acoustical modes, respectively. The origin of these 
terms is best understood by considering the nature of the associated 
vibrations in the limit of long wavelengths. For |k| «r/d, eqn (9.17) 
reduces to 

1 1 ak?d? 


wi x 2a (=+37) and we 3 (m+M) f (9.18) 


The corresponding ratio of amplitudes in eqn (9.16) is found to be 


$l a = and $ a 1, (9.19) 


from the equations of motion. Neighbouring atoms move in opposite 
directions for the positive solution, therefore, but in harmony for the 
negative one. If the two types of atoms carried equal and opposite 
charges, as for a binary ionic compound, the w, mode would result in 
a fluctuating electric field; since this can couple to ‘light’, it is known 
as the optical branch. The acoustical mode is so called because the 
atomic motions are similar to those in a sound wave: they give rise 
to alternating regions of compression and rarefaction but not optical 
activity, as there is no net separation of ionic charge. Being non- 
dispersive for \>>d, when 


Wo dw_ 


r ae A (9.20) 
acoustical phonons have a well-defined speed of propagation, c, at 
long wavelengths. 

For simplicity, the displacement of the atoms has so far been taken 
as collinear with the chain. As such, the discussion has been about 
longitudinal phonons. Vibrational modes in which the ujs and vjs 
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Fig. 9.4 Instantaneous atomic displacements, over a wavelength (A = 7d), for trans- 
verse phonons along a linear binary chain: (a) is for an acoustical mode, and (b) is its 
optical counterpart. 


are perpendicular to the direction of propagation, x, are said to be 
transverse phonons. They exhibit very similar behaviour in terms of 
their dispersion curves, including optical and acoustical character- 
istics; the latter are illustrated in Fig. 9.4. There are two transverse 
modes for every longitudinal excitation, since there are two linearly 
independent orientations that are perpendicular to the direction of 
propagation. 

While the one-dimensional model above may be crude, it captures 
many of the salient features of phonons. The main difference from 
real samples, which are three-dimensional, is that the dispersion 
curves become a function of the wavevector k. For a crystalline ma- 
terial with lattice vectors a, b and c, the symmetries of eqn (9.15) 
generalize to 


w(k)=w(-k) and w(k) =w(k+G), (9.21) 


where G is a linear integer combination of the reciprocal vectors A, 
B and C defined in eqn (7.7). 

Figure 9.5 shows the neutron scattering function, 5(Q, E), from a 
single crystal sample of copper selenide, for Q along the 110 direc- 
tion. The magnitude of the wavevector transfer is given in reciprocal 
lattice units, whereby the boundary of the first Brillouin zone is at 
+0.5. The greatest intensity, by far, is in the neighbourhood of the 
Bragg peak at (0,0), as indicated by the (logarithmic) blue shading. 
A weak plume-like signal, highlighted by the low-level contours, is 
also apparent and resembles the dispersion curve of an acoustical 
phonon. The latter arises because S(Q, E) records the creation and 
annihilation of phonons. That is to say, neutrons transfer an energy 
hw and momentum fk to the sample when a phonon of frequency 
w and wavevector k is generated; E and Q are negative to the same 
extent when it is destroyed. Taking into account the symmetry of 
eqn (9.21), the interaction is summarized by 


B = ce ite and Q=GHk, (9.22) 
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Fig. 9.5 (a) Neutron scattering data, S(Q, E) for momentum transfer along the 110 
direction, from the superionic conductor Cu1.8Se, taken on the TAIPAN spectrometer 
at ANSTO. (b) A plot of the scattered intensity, as a function of energy transfer, for 
Q =0.35 reciprocal lattice units. (Courtesy of Drs. S. A. Danilkin and M. Yethiraj.) 


where k is restricted to lie in the first Brillouin zone. 

Phonons have traditionally been studied with neutron scattering, 
since their energy at atomic wavelengths is comparable to that of 
lattice vibrations. The technical requirements to attain similar mea- 
surements with X-rays are much more demanding, as there is only 
a tiny fractional change in the energy and momentum of the pho- 
ton. The first example of a dispersion curve using X-rays was due to 
Dorner et al. (1987), some thirty years after the earliest work with 
neutrons. The sheer intensity of the latest generation of synchrotron 
sources, however, is starting to make X-rays a viable alternative for 
exploring lattice dynamics. 


9.2.1 Heat capacities 


The phonons studied microscopically with X-ray and neutron scat- 
tering also manifest themselves in certain macroscopic properties 
of the sample. We alluded to one of these in eqn (9.20), where the 
initial slope of the dispersion curve for an acoustical mode gives the 
‘speed of sound’ in the material. Another link occurs in heat capaci- 
ties. Following standard texts such as Blakemore (1985) and Kittel 
(1953), the phonon contribution to the internal energy, U, at a tem- 
perature T is given by 


7 T g(w) hw 
= lates —1 oe eee) 


where k, is the Boltzmann constant and g(w) dw is the number of 
lattice vibrational modes with a frequency between w and w+dw. 
The density of states, g(w), can either be inferred from heat capac- 
ity measurements, 0U/OT, for a series of different temperatures, or 
from scattering data. As T — 0, when the Bose-Einstein occupancy 
term and Debye—Waller factor can be ignored, g(w) reduces to the in- 
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Fig. 9.6 The density of phonon states in aluminium, g(w), estimated from neutron 
scattering measurements taken on the MARI spectrometer at the ISIS facility; the 
dotted line is the Debye model. (Courtesy of Prof. S. M. Bennington.) 


tegral of S(Q,w) with respect to Q over the phonon peaks. A density 
of states function derived from neutron scattering measurements is 
shown in Fig. 9.6. 


9.2.2 Spin waves 


If a crystalline sample is magnetic, then the spins of the unpaired 
electrons can oscillate in a correlated fashion analogous to phonons; 
the excitations are called spin waves or magnons. The phenomenon 
may be visualized by considering a simple one-dimensional model: 
a set of bar magnets that are spaced uniformly and fixed along the 
x-axis, but whose north and south poles are free to rotate in the 
yz-plane. In the lowest energy state, neighbouring magnets will 
be aligned antiparallel to each other. If the ‘left-most’ bar of this 
chain is rotated about the x-axis, then the others will follow in an 
attempt to restore the equilibrium of the antiferromagnetic order. 
A plot of the y- or z-component of the magnetic moments will be a 
sinusoidal function of x and time, similar to Fig. 9.4(b), resulting 
from an angular lag, A0, between successive bars. A dependence of 
A9 on the frequency of rotation, w, means that the x-distance over 
which the helical pattern of bar magnet orientations repeats itself, 
or the wavelength à, will satisfy a characteristic dispersion relation: 
w=w(k). 

As the frequency of rotation in the mechanical model above is re- 
duced, we would expect to become longer because neighbouring 
bar magnets will find it easier to keep up with each other. This in- 
tuition, that k — 0 as w — 0, is supported by a more formal analysis 
which predicts that ar, me 


k? for a ferromagnet , 
w 


x l (9.24) 7 O NOS o a oan 
Ik| for an antiferromagnet, E W/ y 
i 5 -ferromagnetic 
for long wavelength magnons, and has been confirmed by neutron | = 


scattering experiments. Due to the magnetic form factor of Section ae n/d 
3.2.3, the intensity of S(Q,w) along a spin wave dispersion curve 
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diminishes with |Q| even at low temperatures. As with all magnetic 
behaviour, magnons cease to exist above a critical temperature. 


9.3 Molecular spectroscopy 


As discussed in Section 1.3, dynamical information at the atomic 
and molecular level can be ascertained by a variety of spectroscopic 
techniques. X-rays extend the energy range that can be probed with 
optical and UV radiation, and enable the deep electronic structure 
of heavier elements to be studied. Thermal neutrons provide an al- 
ternative to IR, Raman and microwave methods for exploring molec- 
ular vibrations and rotations, as indicated by the similarity of the 
spectra in Fig. 9.7, but what benefit is there to using them? After 
all, access to neutron facilities is limited by comparison with IR and 
Raman instruments. 

Neutrons possess qualities that offer some advantages over tra- 
ditional electromagnetic techniques. One of these is the absence of 
selection rules for the observable transitions. As in the case of the 
optical mode in phonons, IR and Raman techniques are only sensi- 
tive to certain types of molecular motions; neutrons, on the other 
hand, can exchange energy and momentum with all kinds of ex- 
citations. This means that all modes are, in principle, observable 
by inelastic neutron scattering (INS) spectroscopy. In practice, the 
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Fig. 9.7 Part of the infrared (IR), Raman and inelastic neutron scattering (INS) 
spectra of the a-amino acid L-cysteine. (Parker and Haris, 2008) 


scattered intensity is dominated by modes that involve hydrogen 
displacement. The highly penetrating nature of neutrons is also an 
important property, especially when dealing with samples that are 
opaque to photons. It enables the study of adsorbed species on oxide 
and metal catalysts, for example, and hydrogen-in-metals. Another 
positive feature of neutrons is that the intensity of the scattered 
signal, S(Q,w), can be predicted fairly easily for a given molecular 
model. The corresponding calculation for IR and Raman work is 
much more difficult, and so agreement with the measurements is 
limited to a comparison of the normal mode frequencies. While the 
latter help to pin down the interatomic ‘spring constants’, an under- 
standing of the intensity of the signal allows for an inference of the 
amplitudes of the motions. 

Figure 9.8 shows a schematic plot of neutron scattering data from 
a molecular crystal, integrated over Q, for positive energy transfers. 
The signal for E <0 would be a mirror reflection with a diminishing 
intensity. This is due to the Boltzmann factor, whereby lower energy 
states in the sample are more likely to be occupied than higher ones. 
As a result, it is more probable that the neutron will lose energy in 
the encounter than to gain it; the ratio between the two possibilities 
is called detailed balance. 


9.3.1 Quasi-elastic scattering 


The most unusual feature in Fig. 9.8 is the broad line at E=0. Being 
coincident with the elastic component, it is known as quasi-elastic 
scattering. An insight into its origin can be gained by recalling the 
Fourier relationship between the time-dependent structure, ((r, t), 
and the scattering function, S(Q, w), of eqn (8.15). Whereas a lack of 
temporal variation leads to a sharp signal at w =0, and a sinusoidal 
oscillation at frequency wo yields peaks at +w., stochastic changes 
over a characteristic time-scale 7 give rise to a line centred on w=0 
with a width dependent on 1/7. The last assertion reduces to the 
elastic case in the limit of 7 — œ, as expected, but it may seem more 
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Fig. 9.8 A schematic plot of neutron scattering data from a hypothetical molecular 
crystal, S(Q, w) integrated over Q, based on Eckert (1992). 


9.3 Molecular spectroscopy 185 


186 Examples of inelastic scattering 


plausible given a Fourier result of the type 


2 
1 


J edt) = = 7; (9.25) 
j w? + (1/7) 


where the function on the right is called a Lorentzian. Leaving the 
details to Bée (1988), an analysis of quasi-elastic scattering provides 
information on the diffusive translational and reorientational mo- 
tion of atoms and molecules. 


9.3.2 Energy resolution and time-scales 


The central equations of Section 8.2, (8.15) and (8.20), indicate that 
there is a reciprocal Fourier relationship between w and t just as 
there is between Q and r. The result given in eqn (9.25) is an explicit 
example. The resolution with which the energy transfer E =ħw can 
be measured, AF, therefore has a bearing on the time-scale 7 of the 
dynamics that can be explored: 


1 h 
Aw AE’ 
While rapid motions can be inferred from low-resolution energy data, 
slower ones require a very small AF. In the quasi-elastic region, the 
highest energy resolution is achieved with the spin-echo technique 
(Mezei 1972) which makes use of the magnetic dipole of the neutron. 
With a AF as little as a few neV, time-scales of up to 100ns become 
accessible. 


(9.26) 


T ma 


Discrete Fourier transforms 


The discrete nature of computer technology means that the contin- 
uum formulation of Fourier transforms in Section 2.4 is best re- 
placed with a digitized version for numerical calculations. In the 
simplest case, this takes the form 


N-1 
F; = >, fj gA ae (A.1) 


j=0 


where the integers j and k = 0,1,2,..., N—1 and the f; represent a 
sampling of the function f(z) on a grid of N points that are spaced 
evenly along the x-axis, and 


N-1 
fp = — > Fp el rahi (A.2) 


k=0 


With the complementary complex exponentials in the summations, 
these clearly have the essential ingredients to be a Fourier pair. To 
see how they relate to the discussion in Chapter 2, we need to exam- 
ine them more carefully. 

The first thing to note is that both f; and F, are implicitly periodic, 
with a repeat unit of N pixels: 


f; = fj+n (A.3) 
from eqn (A.2), since 


2T +N)k/N = et 2Tik/N eŻ2Tk = E RIRN, 


J=0 j=N 


Fig. A.1 A continuous and periodic function sampled at N points per repeat unit. 
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as illustrated in Fig. A.1; likewise, 


Fk = Pein (A.4) 


from eqn (A.1). 

Although the periodicity of f; means that an independent interval 
of N consecutive pixels can be chosen anywhere along the x-axis, 
the location assigned to the point j =0 is highly significant: as in the 
continuum case, it defines the origin. For example, the substitution 


of a 6-function at 7 =0, 
1 if 7=0, 
i. = 
0 otherwise, 


into eqn (A.1) yields F =1 for all k, as would be expected from eqn 
(2.46) when f(x) =6(a). By contrast, a displacement of the non-zero 
pixel to 7 40 leads to the F;,, being complex: a uniform modulus, but 
an argument that increases linearly with k. The corresponding po- 
sition of the origin in reciprocal space is easiest to verify by putting 
k=0 in eqn (A.1). It gives 


N-1 
Fo = > ip (A.5) 
j=0 


which is the discrete analogue of eqn (2.49). 
With the periodicity of eqn (A.3), and the origin at j = 0, an even 
function is one that satisfies 


f; = f_; = fy_;. 


Hence, the symmetry relationships of eqn (2.48) become 
fn_j FN-k, 
rai NO as n=l NA (A.6) 


where we have also made use of eqn (A.4). Similarly, the discrete 
form of eqn (2.48) is 


ie = i <> FyNy-k = Fé. (A.7) 


This implies that, for a real function f;, Fo is real: 
Fo = Fy = Fğ, 


which is confirmed by eqn (A.5). Assuming that N is an even num- 
ber, and putting k= N/2 in eqn (A.7), 


Fy/2 = Fy /2 š 


Thus the Nyquist component, Fy/2, must also be real. 


A consideration of Fig. A.1 shows that the location of pixel j cor- 
responds to a value of x given by 


Tmax . 

j=2;= nw? (A.8) 
where £max is the length of the repeat unit. While £max is chosen 
at our discretion, it has to be bigger than the (finite) extent of the 
continuous function, f(x), for the calculation to be meaningful. If a 
convolution is to be performed with g(x), say, then £max needs to be 
larger than the combined spread of f(x) ® g(x) to ensure that alias- 
ing artefacts, resulting from the inherent periodicity of the sum, are 
avoided. The equivalence between the integer k and the related con- 
tinuous variable, q, can also be expressed in the manner of eqn (A.8): 


k= g = ZER, (A.9) 
where the reciprocity of Fourier length scales dictates that 


(A.10) 


max X 
Tmax 
The constant of proportionality is 27 if the continuum exponential 
is of the form exp(+igqz); it’s unity for exp(+i27qx). Equation (A.10) 
requires qualification, in that qmax must be at least twice the high- 
est ‘measured’ frequency: due to eqn (A.4), the Fourier coefficients 
pertaining to 


[k| < N/2, 


or |q| < qmax/2, define the Fy spectrum completely. According to eqn 
(A.7), only the components with 0 < k < N/2 need be specified if the 
function f; is real. 

The discrete Fourier transform (DFT) of eqns (A.1) and (A.2) can 
be evaluated very efficiently by using a fast Fourier transform (FFT) 
algorithm. Whereas the direct computation of a DFT entails O(N 2) 
operations, because each of the F; involves the sum of N products, 
the same result can be obtained with just O(N log N) calculations 
by using the clever factorization of an FFT. The associated gain in 
computational speed, of O(N/log,N), is enormous even for modest 
N. The only thing to bear in mind is that, for the most common FFT 
algorithm (Cooley and Tukey, 1965), N has to be a power-of-two: 


N = 2M, 


where M is a positive integer. This requirement can be fulfilled by 
‘padding out’ the f; function, or the F, spectrum, with zeros in an 
appropriate fashion. A suitable procedure for the example of Fig. 
A.1 is illustrated in Fig. A.2: with the increment in z, Ax, unchanged 
between the j-pixels, qmax = 1/Aaz remains the same; the larger N, 
however, means that the F, sample the Fourier spectrum much more 
finely in q. 
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j=0 j=N= 


Fig. A.2 The function in Fig. A.1 zero-padded to make N a power-of-two. 


The DFT of eqns (A.1) and (A.2), and its FFT implementation, can 
be generalized easily to many dimensions. Explicitly, the case of two 
takes the form 


N-1 N’-1 


Paw = D> So fig 128 GHING IRIN’) (A.11) 


G0 90 


and 


i Jaru 
= i2n(jk/N +9'k'/N’ 
fj = SoD ` Rupee ee) (A.12) 
k=0 k'=0 
where the integers j and 7’ relate to x and y, k and k’ correspond to 
qx and qy, and the digitization is on a grid of N x N”. 


Resonant scattering and 
absorption 


For simplicity, the X-ray form factors, neutron scattering lengths 
and SLDs were taken to be real throughout the book. While this is 
a good approximation for thermal neutrons in most cases, and often 
appropriate for X-rays, the imaginary component cannot be ignored 
in the vicinity of certain wavelengths. These correspond to energies 
at which an electron from a core shell can be ejected by an X-ray 
photon, or a compound nucleus may be created with an additional 
neutron; as such, they are associated with a high rate of X-ray and 
neutron absorption. 

The variation of f(A, 0) of eqn (3.18) in the neighbourhood of one 
of these special energies, wo, can be described by the addition of 
a complex term that mimics the behaviour of an isotropic damped 
driven harmonic oscillator close to resonance. That is to say, 


£(X,0) = fo(A, 0) + fR(A) (B.1) 


where fo(à, 0) has the characteristics discussed in Section 3.2 and 
fR(A) is the resonant contribution. Pursuing the classical analogy, 


ee ees ¢ = "t (B.2) 


is the the one-dimensional equation of motion for a harmonic oscil- 
lator, of natural frequency wo, being driven by a sinusoidal force in a 
viscous medium (7) >0). The amplitude of the steady-state response, 
x = Ae!” is easily shown to be 


1 
a ET 
wé — wf a ay 
Identifying A(w) with fr(A), subject to a scaling constant and the 
relevant transformation between frequency and wavelength, 


we —w?—inw 


(w2 — w?)? + 92 w? 


fr(A) x (B.3) 


This is plotted in Fig. B.1. 
Although an f(\,@) with a non-zero imaginary component leads to 


a breakdown of Friedel’s law of Section 7.2.1, the strong variation of 
a scattering length with à close to a resonance can be exploited to 
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[frl 


Fig. B.1 The resonant contribution of eqn (B.3), fr(A), for n=0.1wo. 


circumvent the phase problem. The method is similar to that of Sec- 
tion 7.3.3 in principle, and known as multi-wavelength anomalous 
diffraction (or MAD); details may be found in Hendrickson (1991) 
and Materlik et al. (1994). 

The absorption cross-section of eqn (3.28), Cabs, turns out to be 
proportional to Zm {fr(A)}. This is best understood by considering 
the undeflected passage of an X-ray or neutron (0 =0), of wavelength 
àin vacuo, through a uniform medium of SLD 


= Piia Na 
mM 


B OOt]; (B.4) 


where p N,/m is simply the atomic number density. Using the re- 
fractive index relations of eqns (4.25) and (4.36), the wavenumber 
inside the material is given by 


2T 
k= — — À. 
x p 


The intensity of the wave, Y = y, e!**, will then diminish exponen- 
tially with distance J, 


I= ll’ = I, g ETARA 


if Zm {G}<0. A comparison between this equation and (3.28), along 
with (B.4), yields 


Oabs — —2A’\Im {fr(A)}, (B.5) 


where we have implicitly assumed that Zm {f.(A, @)}=0. 
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